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» 

USE OF TRANSLATIONALLY ALTERED RNA TO CONFER RESISTANCE TO MAIZE 
DWARF MOSAIC VIRUS AND OTHER MONOCOTYLEDONOUS PLANT VIRUSES 

The invention relates generally to the genetic engineering of monocotyledonous 
plants to resist virus infection through the expression of inhibitory transcripts or proteins 
derived from the inhibited virus. In another aspect, the invention relates to the elucidation 
and characterization of the genomic structure and organization of a maize dwarf mosaic 
virus. 

Plant viruses are a major problem in agriculture and cause significant losses in crop 
yield each year. In the past, available approaches for combating plant viruses were primarily 
limited to the selection of plant lines which exhibited genetic resistance to virus infection 
and the application of chemicals designed to protect plants from the organisms responsible 
for introducing the virus to the plant (i.e. viral vectors). 

Recently, a number of approaches for combating plant viruses have been developed 
which are based upon the transformation of susceptible plant species with chimeric genes 
which express transcripts or proteins that inhibit viral infection. These approaches include 
genetically engineering plants to express viral coat protein or coat protein transcripts, viral 
replicases in unmodified or modified form, antisense genes or ribozymes targeting viral 
genomic RNA or transcripts, and altered viral transcripts (for a review, see Fitchen, J.H. et 
al. a Ann. Rev. MicrobioL 47: 739-763 (1993)). To apply any of these approaches, knowledge 
of the structure and organization of the genome of the target virus is necessary. 

With respect to the expression of altered viral transcripts to confer viral resistance, 

« 

limited success has been reported in dicotyledonous plants through the expression of viral 
coat protein transcripts which have been modified to render them incapable of translation. 
Expression of such "untranslatable" viral transcripts in tobacco has been reported to inhibit 
tobacco etch virus (Lindbo, J.A. et aL, Mol. Plant-Microbe Int. 5(2): 144-153 (1992); Lindbo, 
J.A. et al. t Virology 189: 725-733 (1992); WO 93/17098 to Dougherty, W.G. et al. (Sept. 2, 
1993); Lindbo, J.A. et al., The Plant Cell 5: 1749-1759 (1993)), tomato spotted wilt virus 
(Pang, S. et al., Bio/Technology 11: 819-824 (1993): DeHaan et al., Bio/Technology 10: 
1133-1137 (1992) and potato virus Y (Van der Vlug R.A. et al., Plant Mol. Biol. 17: 431- 
439 (1991). 
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The ability of such untranslatable RNAs to inhibit viral infection does not appear to be 
universal, however. Failure of such altered viral transcripts to inhibit viral infection have 
been reported for tobacco mosaic virus (Powell, P.A. et al., Virology 175: 124-130 (1990) 
and zucchini yellow mosaic virus (Fang, G. et al M Mol. Plant-Microbe Int. 6(3J: 358-367 
(1993), a potyvirus similar to tobacco etch virus. Additional unreported failures may also 
exist, since such negative results are rarely published. 

The most prevalent virus infecting maize in the United States and Europe is maize 
dwarf mosaic virus (MDMV). This virus is classified as a member of a family of plant viruses 
known as the potyviruses. The potyviruses are the largest group of plant viruses and are 
characterized by a long, flexuous rod particle morphology and are non-persistently 
transmitted by aphid vectors (see Hollings, M. and Brunt, A., pages 732-807 of "Handbook 
of Plant Virus Infection and Comparative Diagnosis", ed. by E. Kurstak, pub. by 
Elsevier/North Holland Biomedical Press, Amsterdam (1981)). The potyviruses have a 
genome composed of a single strand positive sense messenger RNA molecule which is 
transcribed and translated as one polyprotein that is subsequently cleaved into its 
component parts. The family is composed of many taxonomic strains, with the two most 
common being strains A and B. These strains are differentiated by the ability of MDMV-A to 
infect johnsongrass which is the overwintering host. MDMV-A is primarily localized to the 
southeastern United States due to the occurrence of johnsongrass in this area. MDMV-B is 
more widespread and can be found in the U.S. corn belt and throughout Europe (i.e. Spain, 
France, and Italy). MDMV-B is the most economically important maize virus due to its 

widespread occurrence. 

Viral diseases of maize result in an estimated 5% annual yield reduction as well as 
reduce overall plant health which results in increased susceptibility to other pathogens. 
Experimental trials using common maize inbreds and hybrids have shown yield reductions 
from MDMV as great as 35% in inoculated plots. MDMV is a major crop pest in maize where 
it causes mosaic symptoms and dwarfing of infecting plants, ultimately reducing crop yields 
(Knoke, J.K. et al., pages 235-281 of "Diseases of Cereals & Pulses", volume I, ed. by 
Singh, U.S. et al., pub. by Prentice Hall, Englewood Cliffs, NJ (1992)). When found in 
combination with maize chlorotic mottle virus (MCMV), a synergistic condition known as corn 
lethal necrosis results causing even more severe crop damage (see Uyemoto, J.K.. pages 
141-143 of "Proc. Intl. Maize Virus Disease Colloq. & Workshop", ed. by Gordon, D.T, et 
al, pub. by Ohio State Univ. and Ohio Agric. Res. Dev. Center, Wooster, MA (1983). 
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The economic impact of yield losses due to MDMV has generated considerable 
interest in developing strategies to combat this virus. To date, however, only limited success 
has been achieved in reducing the adverse impact of this virus. Thus there remains a need 
to identify additional effective means for protecting host plants from MDMV. 

Both strains A and B of MDMV are transmitted in nature by aphids in a non-persistent 
manner, thus insect control is not a practical control method. The most effective method of 
control of these diseases is the use of resistant germplasm. In maize, sources of resistant 
germplasm exist to both strains of the MDMV, but the efficacy of the resistance is somewhat 
variable and identification of this material can be difficult. Studies have shown that 
resistance to MDMV is not the result of a single, dominant gene, but rather being multigenic 
(2-5 genes). There has been an abundance of research on the development of alternative 
strategies for conferring resistance in transgenic plants. Most of these strategies have 
focused on the expression of viral genes (i.e. the viral coat protein) in plants as a means of 
conferring resistance. The benefits of these strategies are that the resistance can be 
developed to viruses in which effective natural resistance can not be identified and the 
resistance is more easily transferred to agronomically desirable plant lines. The majority of 
this work has focused on coat protein mediated resistance which is based on the 
expression of the viral coat protein in the plant. Coat protein mediated resistance has been 
particularly effective for some viruses (e.g. tobacco mosaic virus) but inconsistent for other 
viruses (e.g. potyviruses) when tested in model systems such as tobacco and in 
economically important grain crops such as maize, wheat, and rice. 

More recently, another virus resistance strategy has been developed which conferred 
an immune phenotype in plants transformed with segments of virus sequence. The 
phenomenon has been termed RNA-mediated resistance and is thought to be similar to 
sense suppression or co-suppression described in other plant systems. Specifically, plants 
were transformed with a sequence encoding the virus coat protein which had been modified 
to cause premature termination during translation. The expression of this untranslatable 
viral coat protein sequence at high levels was hypothesized to activate a RNA degradation 
cycle which eliminated the transgene mRNA in a sequence specific manner. The pathway 
was then believed to be capable of also eliminating an infecting virus which contains 
sequence highly homologous (>90%) to the transgene sequence. Since the original 
description of RNA-mediated resistance (see Lindbo, J.A et al.. Mol. Plant-Microbe Int. 5(2): 
144-153 (1992) and DeHaan et al.. Bio/Technology 10: 1 133-1 137 (1992)), there have 
been additional descriptions of this form of resistance. Furthermore, it has been shown that 
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prior work thought to be resistance due to expression of a viral protein is more likely to be 
RNA-mediated resistance. However, this strategy has not been effective for all viruses (see 
Powell. P.A. et al.. Virology 175: 124-130 (1990) and Fang, G. et al., Mol. Plant-Microbe Int. 
6(3): 358-367 (1993)). The examples of RNA-mediated resistance have been limited to 
model dicot hosts such as tobacco and potato. It is not known if this resistance will be 
effective in monocots nor what factors will be necessary for induction of this resistance. 

The genomic structure and organization of MDMV has remained largely 
uncharacterized except for the elucidation of viral coat protein coding sequences (see 
Frenkel, M. J. et al. J. Gen. Virol. 72:237-242. (1991); see also Murray, L.E. et al., 
Bio/Technology 11: 1559-1564 (1993)). As a result, it is currently not possible to apply many 
of the more recent recombinant-DNA based approaches that have been used for combating 
plant viruses to MDMV. These approaches require a more extensive understanding of the 
structure and organization of the genome of the target virus than is currently available for 
MDMV. 

In one aspect, the present invention provides a method for protecting a 
monocotyledonous plant from infection by a virus by producing in such a plant an RNA 
molecule whose sequence corresponds, at least in part, to a mRNA or the plus strand RNA 
produced by the virus. The RNA molecule produced according to the method of the 
invention is modified so that it cannot be translated completely as compared to the viral 
RNA to which it corresponds. Included within this aspect of the invention are chimeric 
genes designed to express such modified RNA molecules in monocotyledonous plants, as 
well as monocotyledonous plants containing such chimeric genes stably integrated into their 
genome. Such plants and their progeny are protected from infection by monocotyledonous 
viruses that produce messenger or plus-sense RNA which share sequence identity with the 
modified RNA molecule encoded and expressed by the stably integrated chimeric gene. 

Another aspect of the invention is based upon structural and organizational 
information that has been elucidated for the genome of strain B ol Maize Dwarf Mosaic 
Virus (MDMV-B) upstream of the coat protein gene. Included in this aspect of the invention 
are chimeric genes designed to express coding sequences for MDMV-B proteins including 
the coat protein (nucleotides 7308-8291 of SEQ ID No. 1), the RNA dependent RNA 
polymerase (RdRp) (nucleotides 5745-7307 of SEQ ID No. 1). proteinase (nucleotides 
4452-5744 of SEQ ID No. 1), a 6K protein (nucleotides 4293-4451 of SEQ ID No. 1), 
cylindrical inclusion protein (CIP) (nucleotides 2376-4292 of SEQ ID No. 1). P3 proteinase 
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(nucleotides 1134-2375 of SEQ ID No. 1). and a portion of the helper components 
proteinase (HC-Pro) (nucleotides 3-1133 of SEQ ID No. 1). Methods for protecting plants 
from MDMV infection by transforming them with these chimeric genes are included w.th,n 
this aspect of the invention along with the resulting transgenic plants and their progeny. 

The MDMV-B coding sequences may also be modified according to the first aspect of 
the present invention so that the RNA derived therefrom cannot be properly translated. The 
present invention includes chimeric genes designed to express such translation^ altered 
MDMV-B RNAs in plants. Methods for protecting plants from MDMV infection by 
transforming them with these chimeric genes are included within this aspect of the invent-on 
along with the resulting transgenic plants and their progeny. 

The following sequences according to the invention are disclosed in the sequence listing: 

SEQ ID No. 1 : Sequence of the polycistronic messenger RNA of maize dwarf mosaic 

virus, strain B. 

SEQ ID NO. 2: Sequence of the polyprotein encoded by the polycistronic messenger RNA 

of maize dwarf mosaic virus, strain B. 
SEQ ID No. 3: First internal control alcohol dehydrogenase PCR primer used in analysis 

of To plants as described in Example 4. 
SEQ ID No. 4: Second internal control alcohol dehydrogenase PCR primer used in 

analysis of T 0 plants as described in Example 4. 
SEQ ID No. 5: First PCR primer for the synthetic PAT gene used in analysis of T 0 plants 

as described in Example 4. 
SEQ ID No. 6: Second PCR primer for the synthetic PAT gene used in analysis of T 0 

plants as described in Example 4. 
SEQ ID No. 7: First PCR primer for the Nla proteinase gene used in analysis of T„ plants 

as described in Example 4. 
SEQ ID No. 8: Second PCR primer for the Nla proteinase gene used in analysis of T„ 

plants as described in Example 4. 

For purposes of describing the present invention, the term "translationally altered 
RNA" is used to refer to a modified form of a naturally occurring messenger RNA sequence 
which cannot be completely translated compared to the unmodified, naturally occurring 
form A translationally altered RNA may be incapable of being translated at all or it may be 
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capable of being partially translated into an attenuated peptide corresponding to a portion 
of the peptide encoded by the naturally occurring messenger RNA sequence from wh.ch the 

translationally altered RNA is derived. 

The coding sequence for a naturally occurring viral RNA sequence may be mod.f.ed to 
encode a translationally atiered RNA, for example, by removing its ATG initiation codon or 
by utilizing a portion which does not include the initiation codon. Other means tor 
translationally altering a naturaHy occurring viral RNA molecule include introducing one or 
more premature stop codons and/or interrupting the reading frame. 

The basis for the present invention is two-fold. The first basis for the present 
invention is the discovery that reduced susceptibility to infection by a virus may be conferred 
upon a monocotyledonous plant by producing in the plant a translationally altered RNA 
molecule corresponding in sequence to a plus-sense or messenger RNA molecule of the 
target virus. The second basis for the present invention is the elucidation and 
characterization by the inventors of the genomic structure and organization of strain B of 
maize dwarf mosaic virus (MDMV-B). These two bases are addressed consecutively below 
and are both represented by the examples demonstrating resistance to MDMV-B v.a 
expression of a translationally altered RNA in a transgenic maize plant. 



The first aspect of the present invention is directed to a general method for reducing 
the susceptibility of a monocotyledonous plant to viral infection by producing in the plant a 
translationally altered RNA molecule corresponding to a messenger RNA sequence of the 
target virus. Viruses infecting monocotyledonous plants will be referred to as 
monocotyledonous viruses. A method is provided for protecting progeny of a 
monocotyledoneous parent plant from viral infection by transforming said parent plant with a 
chimeric gene comprising a monocotyledonous plant promoter operably linked to a 
nucleotide sequence derived from the genomic sequence of a virus inlecting 
monocotyledoneous plants, wherein said nucleotide sequence contains a modification 
rendering a messenger RNA transcribed from said nucleotide sequence incapable of 
complete translation, and obtaining progeny plants. Alternatively, said progeny of a parent 
plant can be protected from viral infection by bree^ g the parent plan, with a 
monocotyledonous plant having an inheritable trait ol resistance to infection due to its 
expression of a chimeric gene comprising a monocotyledonous plant promoter operably 



WO 97/02352 



-7- 



PCT/EP96/02673 



linked to a nucleotide sequence derived from the genomic sequence of a virus infecting 
monocotyledoneous plants, wherein said nucleotide sequence contains a modification 
rendering a messenger RNA transcribed from said nucleotide sequence incapable of 
complete translation 

The preferred approach for producing the translationally altered RNA molecule in a 
monocotyledonous plant is by introducing a chimeric gene designed to express this 
molecule into the genome of the plant. Such a chimeric gene will consist of a plant 
promoter operably linked to a nucleotide sequence derived from the genomic sequence of a 
virus infecting monocotyledoneous plants, wherein said nucleotide sequence contains a 
modification rendering a messenger RNA transcribed from said nucleotide sequence 

incapable of complete translation. 

The promoter component may be any monocotylodoneous plant promoter that is any 
promoter which is capable of regulating or directing the expression of an operably linked 
gene in the targeted monocotyledonous plant. Such promoters are well known in the art. 
Preferably, a promoter which is capable of directing strong constitutive expression is used. 
Such promoters include, but are not limited to, the maize ubiquitin promoter described in 
Toki et al., Plant Physiol. 100: 1503-1507 (1992) . the maize phosphoenolpyruvate 
carboxylase (PEPC) promoter as described in Hudspeth, R.L. et al., Plant Molec. Biol. 12: 
579-589 (1989), and the CaMV 35S promoter as described in Kay et al., Science 236: 
1299-1302 (1987). 

The coding sequence component comprises a sequence which, when transcribed, 
produces a translationally altered RNA molecule corresponding to a target viral sequence. 
The target viral sequence is a messenger RNA (rnRNA) molecule of the target virus, or a 
portion thereof. Since the target viral sequence is naturally translatable when a translation 
initiation codon is present, it is modified so as to render it translationally altered. For any 
given target viral sequence, the skilled artisan will be able to determine various 
modifications which could be made to render the resulting RNA molecule translationally 
altered. 

Translation of an rnRNA molecule in a plant cell generally requires the presence of an 
initiation AUG codon followed by an uninterrupted string of amino acid codons (known as 
the reading frame) ending with a translational stop codon, which may be either UAA. UAG 
or UGA. A DNA molecule encoding a translatable rnRNA molecule may be modified to 
encode a translationally altered RNA. for instance, by either removing the initiation ATG 
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codon. interrupting the reading frame, adding premature stop codons, or by a combination 

of these modifications. 

Introduction of one or more premature stop codons (encoded by DNA codons TAA, 
TAG or TGA) in a target viral sequence may be accomplished by adding or deleting 
nucleotides or by modifying existing nucleotides using standard techniques such as site 
directed mutagenesis or mutagenesis by PCR. Adding or deleting nucleotides may have 
the additional benefit of interrupting the reading frame, which also has the effect of 
translationally altering the RNA molecule. While the addition of a premature stop codon 
anywhere along the length of the target viral sequence will render it translationally altered 
as that term is used herein to describe the invention, it is preferable to introduce such stop 
codons near the 5' end of the target viral rnRNA so that any attenuated peptides which may 
be produced via partial translation are 20 amino acids or less in length. 

The reading frame of a target viral sequence may be interrupted by the addition or 
deletion of nucleotides in the DNA coding sequence. As with the addition of premature stop 
codons. it is preferable to interrupt the reading frame near the 5' end of the target viral RNA 
so that any attenuated peptides corresponding to a portion of the peptide encoded by the 
target viral RNA which may be produced via partial translation are 20 amino acids or less in 
length. 

Another way to translationally alter the target viral sequence is to remove the 
translation initiation codon, which will be an ATG. This may be accomplished simply by 
choosing a target viral sequence which does not include the translation initiation codon. 
Alternatively, this may be accomplished by disrupting the ATG codon either by adding, 
deleting or modifying nucleotides within this codon using standard techniques. 

Any messenger RNA molecule produced by the target monocotyledonous virus, or 
any portion of such a molecule, may be used as the target viral sequence. The target viral 
sequence is preferably at least 120 nucleotides in length, more preferably at least 250 
nucleotides in length, and most preferably at least 500 nucleotides in length. 

A translationally altered viral RNA according to the invention includes any modified 
form of a naturally occurring viral messenger RNA sequence which cannot be completely 
translated as compared to the unmodified, naturally occurring form. Thus a translationally 
altered viral RNA may either be incapable of being translated at all. or it may be capable of 
translating an attenuated peptide corresponding to a portion of the peptide encoded by the 
target viral sequence used as a template. 
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The inhibitory effect of a translationally altered viral RNA is contemplated to be based, 
at least in part, upon its effect on host cell degradation mechanisms. Production of a 
translationally altered viral RNA in a plant cell is contemplated to trigger one or more cellular 
RNA degradation mechanisms which target the translationally altered viral RNA, as well as 
any corresponding homologous unaltered viral RNA molecules which may be present in the 
cell (see. e.g. page 550 of Dougherty, W.G. et al.. Mol. Plant-Microbe Int. 7(5): 544-552 
(1994); Chasan, R.. The Plant Cell 6: 1329-1331 (1994)). 

The ability to translate an attenuated peptide, particularly a short peptide less than 20 
amino acids, is contemplated to enhance the triggering effect of the translationally altered 
viral RNA upon host cell RNA degradation pathways contemplated to play a role in inhibition 
of viral infection. Thus translationally altered RNAs which are capable of translating an 
attenuated peptide are preferred. More preferably, the translationally altered viral RNA is 
capable of translating an attenuated peptide less than 20 amino acids in length. For target 
viral RNAs which do not include a translation initiation codon. one may be added in 
conjunction with the addition of a premature stop codon or interruption of the reading frame 
to create a translationally altered RNA capable of translating an attenuated peptide (see. for 
example, the construct pCIB5018 described in Example 4). 

Target viral sequences may be selected from the group consisting of a potyvirus. 
luteovirus. tenuiivirus. carmovirus, machlovirus. geminivirus and reovirus sequences and 
may correspond to the coding sequence for any viral protein, such as a viral coat protein, 
replicase. proteinase, inclusion body protein, helicase. 6K protein and VPg. Such 
sequences are well known for several monocotyledonous viruses including, but not limited 
to, MDMV (see SEQ ID NO. 1). Sugarcane mosaic virus (partial sequence; see Frenkel, M. 
J. et al. J. Gen. Virol. 72:237-242. (1991)). Johnsongrass mosaic virus (partial sequence) 
(see Gough, K. H. et al., J. Gen. Virol. 68:297-304, (1987). maize chlorotic mottle virus (see 
Nutter. R. C. et al. Nucleic Acids Research 17:3163-3177. (1989)). maize chlorotic dwarf 
virus (see WO 94/21796), maize rough dwarf virus (partial sequence) (see Marzachi, C. et 
al. Virology 180:518-526. (1991)). maize stripe virus (partial sequence) (see Huiet. L. et al. 
Virology 182:47-53. (1991); Huiet. L. et al. J. Gen. Virol. 73:1603-1607. (1992); Huiet, L. et 
al. GenBank Accession Number L3446, (1993)), maize streak virus (see Mullineaux. P. M. 
et al EMBO J. 3:3063-3068, (1984)), barley yellow dwarf virus (see Larkins, B. A. et al. J. 
Gen. Virol. 72:2347-2355, (1991)). and wheat spindle streak virus (partial sequence) (see 
Sohn, A. et al. Arch. Virol. 135:279-292, (1994)). 



WO 97/02352 



- 10 - 



PCT/EP96/02673 



• Suitable host plants which may benefit from the production of translationally altered 
viral RNA such as altered MDMV RNA include any monocotyledenous species which are 
susceptible to viral infection, particularly infection by a member of the potyvirus family. In 
particular, suitable host plants inlcude maize, wheat, sugarcane and sorghum. 

In a preferred embodiment, the target viral sequence used is a coding sequence 
which is identical or highly homologous among two or more monocotyledonous viruses or 
virus strains. Expression of a translationally altered RNA in a monocotyledonous plant 
based on such a shared sequence is contemplated to inhibit infection by any of the viruses 
which produce a messenger RNA having homology with the target viral sequence. 



A second aspect of the present invention is based upon the elucidation and 
characterization by the inventors of the genomic structure and organization of strain B of 
maize dwarf mosaic virus (MDMV-B). Previously, only the genomic sequence of the MDMV- 
B coat protein was known (see Frenkel. M. J. et al.. J. Gen. Virol. 72: 237-242 (1991)). As a 
result of the disclosed invention it is now possible to apply many of the more recent 
recombinant-DNA based approaches that have been used for combating plant viruses to 
MDMV such as the use of chimeric genes comprising a plant promoter operably linked to a 
nucleotide sequence derived from the genomic sequence of maize dwarf mosaic virus stra.n 
B encoding a viral protein other than a coat protein, wherein transgenic expression of sa.d 
chimeric genes in a plant inhibits infection of said plant with said virus. 

The MDMV-B positive strand RNA genome is believed to be approximately 10.000 
bases in length based on the length of other potyviruses. The sequence of 8530 
nucleotides beginning at the 3' end of the MDMV-B genome is set forth in SEQ ID NO: 1. A 
single long open reading frame was identified within this sequence of the viral genome and 
the polyprotein amino acid sequence encoded by this open reading frame is provided in 
SEQ ID NO: 2. With the sequence information provided, this viral genome can be isolated 
and cloned using a variety of standard genetic engineering techniques well known to those 
of skill in the art. Three DNA fragments covering 85% of the MDMV-B genome have been 
cloned into a Bluescript II SK plasmid backbone (Stratagene). transformed and propagated 
in the E. coli cell line HB101. and deposited on June 29. 1995 with the Midwest Area 
National Center for Agricultural Utilization Research (formerly known as the National 
Regional Research Lab and still referred to by the corresponding acronym "NRRL"). One of 
the plasmids designated "1-47" and deposited under the accession No. NRRL B-21479 
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contains nucleotides 3252-8530 of the MDMV-B genome. Another plasmid designated "2- 
24 w and deposited under the accession No. NRRL B-21480 contains nucleotides 1866-3317 
of the MDMV-B genome. Yet another plasmid designated "9-1 -5" and deposited under the 
accession No. NRRL B-21481 contains nucleotides 1-2122 of the MDMV-B genome . 

The polyprotein encoded by the MDMV-B genome includes a single coat protein 
designated CP whose coding sequence extends from nucleotide 7308 to 8291 of SEQ ID 
No. 1 and whose amino acid sequence extends from amino acid 2436 to 2763 of SEQ ID 
No. 2. The MDMV-B polyprotein is also contemplated to include a replicase protein, three 
proteinases, a 6K protein, a helper component, proteins involved in viral movement in the 
host plant (both cell to cell and long distance transport), a helicase protein and a VPg 
protein. 

MDMV-B is contemplated to contain a serine-like proteinase analogous to serine-like 
proteinases that have been identified in related potyviruses. These serine-like proteinases 
have a characteristic catalytic domain of three amino acids consisting of a histidine at 
position 1 of the domain, an aspartic acid at the second position, and a cysteine at the third 
(see Bazan, J. F. and Fletterick, R. J., Proc. Natl. Acad. Sci. USA 85: 7872-7876 (1988)). 
These amino acids are separated in the primary amino acid sequence by a region spanning 
approximately 140 amino acids. The intervening sequences between each of the catalytic 
domain sequences exhibits additional limited homology among the known proteinases (see 
Bazan, J. F. and Fletterick, R. J.. Proc. Natl. Acad. Sci. USA 85: 7872-7876 (1988)). 
Based upon comparison with the known proteinase sequences, the MDMV-B proteinase 
catalytic domain is contemplated to span a 105 amino acid sequence from position 1718 to 
1823 of SEQ ID No: 2 with the three catalytic residues occurring at amino acids 1718, 1753, 

and 1823 of SEQ ID No. 2 

MDMV-B is also contemplated to contain a second proteinase analogous to the 
cysteine proteinases that have been identified in related potyviruses. The active-site 
residues form a catalytic diad made up of a conserved cysteine and histidine which are 
separated by 72 amino acids (see Oh, C. and Carrington, J. C. f Virology 173:692-699, 
(1989)). This proteinase is located within the carboxy-terminus ol the HC-Pro region of the 
potyvirus polyprotein. Based upon comparison with the known proteinase sequences of 
tobacco etch virus, the MDMV-B HC-Pro proteinase domain is contemplated to span a 74 
amino acid region from position 263 to 336 of SEQ r No: 2 with the two catalytic residues 
occurring at amino acids 263 and 336. 
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' The location of the MDMV-B putative helicase domain can be identified based on the 
homology with other known viral helicase domains (see Gorbalenya. A. E. et al.. Nucleic 
Acids Research 17 (12):47 13-4730, (1989)). The helicase domain consists of seven distinct 
highly conserved segments which correspond to the NTP-binding motif. The" primary 
consensus site consists of a glycine at position 1 of the motif, glycine at position 3. lysine at 
position 4. and either a serine or threonine at position 5 (see Gorbalenya, A. E. et al. supra). 
The conserved helicase domain is located in the MDMV-B genome within a region encoding 
the cylindrical inclusion protein (CIP) and is found from amino acids 880 to 1010 of SEQ ID 
No: 2. The conserved domain (GxGDS) is located at amino acids 883. 885, 886. and 887 
of SEQ ID No: 2. 

The coding sequence for the replicase gene of MDMV-B may also be determined by 
the location of conserved motifs common to viral replicase genes and by identification of 
putative viral proteinase cleavage sites bordering the replicase coding sequence. 
Conserved motifs have been found in other viral replicases. In particular, the conserved 
amino acid motif GDD (known as domain C) is the hallmark consensus sequence for all 
RNA- dependent replicases (Poch et al. EMBO 8: 3867-3874 (1989)). This conserved motif 
is found at amino acids 2266-2268 in the MDMV-B open reading frame (SEQ ID No: 2). 
Two additional conserved motifs characteristic of a plant viral replicase have been identified 
and designated as domain A and B (Poch et al., supra). Domain A is a 17 amino acid 
sequence with two centrally conserved amino acids which are present in the MDMV-B 
genome at amino acids 2163 and 2168 of SEQ ID No: 2. Domain B is a 10 amino acid 
sequence consisting of 5 conserved amino acids which are present in the MDMV-B genome 
at amino acids 2222, 2223, 2224. 2225 and 2226 of SEQ ID No: 2. 



The isolated MDMV-B genomic sequences taught by the present invention are 
particularly useful for the development of viral resistance in susceptible host plants. With 
the information provided by the present invention, several approaches for inhibiting plant 
virus infection in susceptible plant hosts which involve expressing in such hosts various 
inhibitory transcripts or proteins derived from the target virus genome may now be applied 
to MDMV. 

Use of translationally altered RNA in a method for producing a monocotyledonous 
plant with an inheritable trait of resistance to infection by a maize dwarf mosaic virus 
comprising transforming said plant with a chimeric gene comprising a monocotyledonous 
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plant promoter operably linked to a nucleotide sequence derived from the genomic 
sequence of a maize dwarf mosaic virus, wherein said nucleotide sequence contains a 
modification rendering a messanger RNA transcribed from said nucleotide sequence 
incapable of complete translation, may now be applied to MDMV-B. as demonstrated by 
Example 4. 

Another approach which may be used to confer plant virus resistance is to express 
the gene of the target virus in the host plant (e.g. WO 94/18336 to Turner et al. for potato 
leaf roll virus and WO 91/13542 to Zaitlin et al. for tobacco mosaic virus; herein 
incorporated by reference in their entirety). This approach may also be applied to MDMV-B 
using the information provided by the present invention. 

For resistance strategies which depend upon expression of a viral replicase coding 
sequence in a transgenic plant, a cDNA clone encompassing nucleotides 5745 to 7307 of 
SEQ ID No: 1, contemplated to include the active domains of the MDMV-B can be used for 
plant transformation. More preferably, such strategies may be employed by transforming a 
plant with larger expressible fragments of the MDMV-B genome contemplated to 
encompass the entire replicase protein. In this case, the MDMV-B replicase would be 
cleaved from the encoded polypeptide when exposed to MDMV-B viral proteinase in the 
plant cell. 

The MDMV-B replicase coding sequence may be engineered for recombinant 
expression in a monocotyledonous host plant which is normally susceptible to infection by 
MDMV-B. Expression of MDMV-B replicase in such a monocotyledonous host plant is 
contemplated to confer resistance to (i.e. inhibit) MDMV-B infection. 

Suitable host plants which may benefit from application of any of the resistance 
approaches described above include any monocotyledonous species which are susceptible 
to infection by MDMV-B. In particular, suitable host plants are contemplated to include 
maize, sorghum and sugarcane. 

To express inhibitory transcripts or proteins derived from the MDMV-B genome in a 
host plant cell, the corresponding coding sequence is operably linked to regulatory 
sequences which cause its expression in the chosen host plant cell. Examples of 
promoters capable of functioning in plants or plant cells, i.e.. those capable of driving 
expression of the associated coding sequences such as MDMV-B CP in plant cells, include 
the cauliflower mosaic virus (CaMV) 19S or 35S promoters and CaMV double promoters; 
nopaline synthase promoters; pathogenesis-related (PR) protein promoters; small subunit of 
ribulose bisphosphate carboxylase (ssuRUBISCO) promoters; plant ubiquitin gene 
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promoters; plant actin gene promoters; plant pith-preferred promoters, and the like. 
Preferred are the rice actin promoter (McElroy et al., Mol. Gen. Genet. 231 : 150 (1 991)). 
maize ubiquitin promoter (EP-A-342 926; Taylor et al.. Plant Cell Rep.12: 491 (1993); Toki 
et al., Plant Phys. 100:1503-1507 (1992)). a maize pith-preferred promoter (WO 93/07278 
incorporated by reference herein in its entirety; in particular see Figure 24 and pages 27- 
28). and the Pr-1 promoter from tobacco. Arabidopsis. or maize (see EP-A-332 104). Also 
preferred are the 35S promoter and an enhanced or double 35S promoter such as that 
described in Kay et al.. Science 236: 1299-1302 (1987) and the double 35S promoter 
cloned into pCGN2113. deposited as ATCC 40587. The promoters themselves may be 
modified to manipulate promoter strength to increase expression of MDMV-B coding 
sequences in accordance with art-recognized procedures. 

The chimeric DNA construct(s) of the invention may contain multiple copies of a 
promoter or multiple copies of a particular coding sequence. In addition, the construct(s) 
may include coding sequences for markers and coding sequences for other peptides such 
as signal or transit peptides, each in proper reading frame with the other functional 
elements in the DNA molecule. The preparation of such constructs are within the ordinary 

level of skill in the art. 

Since the MDMV-B proteins are naturally expressed as part of a polyprotein, each 
protein does not include its own translation initiation and translation stop codon. To express 
such proteins individually in the context of a chimeric gene, a translation initiation codon will 
need to be added immediately adjacent to the first codon if one does not occur as part of 
the coding sequence. The skilled artisan will recognize that addition of such a translation 
initiation codon will add a methionine amino acid to the end of the encoded protein. Such 
an addition is not contemplated to have any significant effect upon the properties of the 
protein. Also, a translation stop codon will need to be added to the chimeric gene 
immediately after the last codon of the protein if one does not naturally occur at this 
location. 

Useful markers include peptides providing herbicide, antibiotic or drug resistance, 
such as. tor example, resistance to hygromycin. kanamycin. G418. gentamycin, lincomycin. 
methotrexate, glyphosate, phosphinothricin, or the like. These markers can be used to 
select cells transformed with the chimeric DNA constructs of the invention from 
untransformed cells. Other useful markers are peptidic enzymes which can be easily 
detected by a visible reaction, for example a color reaction, for example luciferase. 
fi-glucuronidase, or B-galactosidase. 



PCT/EP96/02673 

WO 97/02352 

- 15 - 



Standard recombinant DNA and molecular cloning techniques used in the following 
examples are well known in the art and are described by J. Sambrook, E. F. Fritsch and T. 
Maniatis, Mojgcular Clnninc: A 1 ahnratnrv manual. Cold Spring Harbor laboTatory. Cold 
Spring Harbor. NY (1989) and by TJ. Silhavy. M.L. Berman. and L.W. Enquist, Experiments 
w .th n»ne Fusions . Cold Spring Harbor Laboratory. Cold Spring Harbor. NY (1984). 

Example 1 : Construction of Plant Transformation Vectors 

Numerous transformation vectors are available for plant transformation, and the 
genes of this invention can be used in conjunction with any such vectors. The selection of 
vector for use will depend upon the preferred transformation technique and the target 
species for transformation. For certain target species, different antibiotic or herbicide 
selection markers may be preferred. Selection markers used routinely in transformafon 
include the nptll gene which confers resistance to kanamycin and related antibiotics 
(Messing & Vierra. Gene 19: 259-268 (1982); Bevan et al.. Nature 304:184-187 (1983)). the 
bar gene which confers resistance to the herbicide phosphinothricin (White et at.. Nucl. 

Acids Res. 18: 1062 (1990). Spencer et a... Theor. Appl. Genet. 79: 625-631(1990)). the 

hph gene which confers resistance to the antibiotic hygromycin (Blochinger & Diggelmann. 

Mol Cell Biol 4: 2929-2931 (1984)). and the dhfr gene, which confers resistance to 

methotrexate (Bourouis et al.. EMBO J. 2(7): 1099-1104 (1983)). 

1 Construction of v^tnrs Suitabl e f o r A q r obact erinm Transformation 

Many vectors are available for transformation using Agrobacterium tumefaciens. 
These typically carry at least one T-DNA border sequence and include vectors such as 
PBIN19 (Bevan. Nucl. Acids Res. (1984)). Below the construction of two typical vectors rs 
described. 

1.1. Construction nf pHIB200 and pCIB2001 

The binary vectors pCIB200 and pCIB2001 are used for the construction of 
recombinant vectors for use with Agrobacterium and was constructed in the following 
manner. pTJS75kan was created by Narl digestion of pTJS75 (Schmidhauser & Helinski. J 
Bacteriol. 164: 446-455 (1985)) allowing excision of the tetracycline-resistance gene, 
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followed by insertion of an Accl fragment from pUC4K carrying an NPTII gene (Messing & 
Vierra, Gene 19: 259-268 (1982); Bevan et al.. Nature 304: 184-187 (1983); McBride et al., 
Plant Molecular Biology 14: 266-276 (1990)). Xhol linkers were ligated to the EcoRV 
fragment of pCIB7 which contains the left and right T-DNA borders, a plant selectable 
nos/nptll chimeric gene and the pUC polylinker (Rothstein et al.. Gene 53: 153-161 (1987)), 
and the Xhol-digested fragment was cloned into Sail-digested pTJS75kan to create 
PCIB200 (see also EP-A-332 104. example 19). pCIB200 contains the following unique 
polylinker restriction sites: EcoRI. Sstl, Kpnl, Bglll, Xbal, and Sail. P CIB2001 is a derivative 
of P CIB200 which was created by the insertion into the polylinker of additional restriction 
sites. Unique restriction sites in the polylinker of pCIB2001 are EcoRI. Sstl. Kpnl. Bglll, 
Xbal. Sail. MM. Bell. Avrll. Apal, Hpal. and Stul. P CIB2001. in addition to containing these 
unique restriction sites also has plant and bacterial kanamycin selection, left and right T- 
DNA borders for Agrobacterium-mediated transformation, the RK2-derived trfA function for 
mobilization between E. coli and other hosts, and the OriT and OriV functions also from 
RK2. The pCIB2001 polylinker is suitable for the cloning of plant expression cassettes 
containing their own regulatory signals. 

12 rnnstmction of PCIBIO and Hy gmmvcm Selection Derivatives thereof 

The binary vector pCIBIO contains a gene encoding kanamycin resistance for 
selection in plants. T-DNA right and left border sequences and incorporates sequences 
from the wide host-range plasmid pRK252 allowing it to replicate in both E. coli and 
Agrobacterium. Its construction is described by Rothstein et al., Gene 53: 153-161 (1987). 
Various derivatives of pCIBIO have been constructed which incorporate the gene for 
hygromycin B phosphotransferase described by Gritz et al.. Gene 25: 179-188 (1983)). 
These derivatives enable selection of transgenic plant cells on hygromycin only (pCIB743). 
or hygromycin and kanamycin (pCIB715, pCIB717). 

2 . nnnstmction nf Vectors Suitahle for non -Anrnhacterium Transformation. 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transformation vector and consequently 
vectors lacking these sequences can be utilized in fdition to vectors such as the ones 
described above which contain T-DNA sequences. " Transformation techniques which do not 
rely on Agrobacterium include transformation via particle bombardment, protoplast uptake 
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(e g PEG and electroporation) and microinjection. The choice of vector depends iargeiy on 
the preferred se.ection for the species being transformed. Be.ow. the construction of some 
typical vectors is described. 

2 1 rnnstructin n nf pCIB3064 

PCIB3064 is a pUC-derived vector suitable for direct gene transfer techmques .n 
combination with se.ection by the herbicide basta (or phosphinothricin). The 
PC.B246 comprises the CaMV 35S promoter in operational fusion to the E. col, GUS gene 
and the CaMV 35S transcriptional terminator and is described in WO 93/07278. The 35S 
promoter of this vector contains two ATG sequences 5' of the start site. These sites were 
mu tated using standard PGR techniques in such a way as to remove the ATGs and 
generate the restriction sites Sspl and Pvu.l. The new restriction s.tes were 96 and 37 bp 
away from the unique Sail site and 101 and 42 bp away from the actual start site. The 
resultant derivative of P C.B246 was designated P CIB3025. The GUS gene was then 
excised from pCIB3025 by digestion with Sail and Sad. the termini rendered blunt and re- 
lated to generate p.asmid P C.B3060. The p.asmid pJ.T82 was obtained from the John 
,nnes Centre. Norwich and a 400 bp Sma. fragment containing the bar gene * J 
Streptomyces viridochromogenes was excised and inserted into the Hoa. s.te of pC.B3060 
(Thompson et al. EMBO J 6: 251 9-2523 (1 987)). This generated P C,B3064 wh.ch 
comprises the bar gene under the control of the CaMV 35S promoter and terminator for 
herbLde se.ection. a gene fro ampiciHin resistance (for se.ection in E. co.i) and a 
with the unique sites Sphl. Pstl. Hind.,., and BamHI. This vector is suitable for the clon.ng 
of plant expression cassettes containing their own regulatory s.gnals. 

2 2 r.»nfitrtJCtion o « pROG19 and pSOG35 

PSOG35 is a transformation vector which utilizes the E. coli gene dihydrofolate 
reductase (DHFR) as a selectable marker conferring resistance to methotrexate. PCF I was 
used to amplify the 35S promoter (-800 bp), intron 6 from the maize Adh1 gene (-550 bp) 
and 18 bp of the GUS untranslated leader sequence from pSOGI.O. A 250 bp fragment 
encoding the E. coli dihydrofolate reductase type I. gene was also amplified by PGR and 
these two PCR fragments were assembled with a Sacl-Pstl fragment from pB.221 
(Clontech) which comprised the P UC1 9 vector backbone and the nopa.ine synthase 
terminator. Assembly of these fragments generated P SOG19 which contains the 35S 
promoter in fusion with the intron 6 sequence, the GUS leader, the DHFR gene and the 
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nopaline synthase terminator. Replacement of the GUS leader in pSOG1 9 with the leader 
sequence from Maize Chlorotic Mottle Vims (MCMV) generated the vector pSOG35. 
DSOG19 and pSOG35 carry the pUC gene for ampicillin resistance and have Hindlll, Sphl, 
Pstl and EcoRI sites available for the cloning of foreign sequences. 

Example 2: Construction of Plant Expression Cassettes 

Gene sequences intended for expression in transgenic plants are firstly assembled in 
expression cassettes behind a suitable promoter and upstream of a suitable transcription 
terminator to create a chimeric gene. These expression cassettes can then be easily 
transferred to the plant transformation vectors described above in Example 1 . 

Promoter Selection 

The selection of a promoter used in expression cassettes or chimeric genes will 
determine the spatial and temporal expression pattern of the transgene in the transgenic 
plant. Selected promoters will express transgenes in specific cell types (such as leaf 
epidermal cells, mesophyll cells, root cortex cells) or in specific tissues or organs (roots, 
leaves or flowers, for example) and this selection will reflect the desired location of 
expression of the transgene. Alternatively, the selected promoter may drive expression of 
the gene under a light-induced or other temporally regulated promoter. A further alternative 
is that the selected promoter be chemically regulated. This would provide the possibility of 
inducing expression of the transgene only when desired and caused by treatment with a 
chemical inducer. 

TranscriPtinnal Terminators 

A variety of transcriptional terminators are available for use in expression cassettes. 
These are responsible for the termination of transcription beyond the transgene and its 
correct polyadenylation. Appropriate transcriptional terminators and those which are known 
to function in plants and include the CaMV 35S terminator, the tml terminator, the nopaline 
synthase terminator, the pea rbcS E9 terminator. These can be used in both 
monocotyledons and dicotyledons. 
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Se quences for thn Enhance ment or Regulation of Expression 

Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of this 
invention to increase their expression in transgenic plants. 

Various intron sequences have been shown to enhance expression, particularly in 
monocotyledonous cells. For example, the introns of the maize Adh1 gene have been 
found to significantly enhance the expression of the wild-type gene under its cognate 
promoter when introduced into maize cells. Intron 1 was found to be particularly effective 
and enhanced expression in fusion constructs with the chloramphenicol acetyltransf erase 
gene (Callis et al., Genes Develop. 1: 1183-1200 (1987)). In the same experimental 
system, the intron from the maize bronzel gene had a similar effect in enhancing 
expression (Callis et al., supra). Intron sequences have been routinely incorporated into 
plant transformation vectors, typically within the non-translated leader. 

A number of non-translated leader sequences derived from viruses are also known to 
enhance expression. Specifically, leader sequences from Tobacco Mosaic Virus (TMV, the 
"W-sequence"), Maize Chlorotic Mottle Virus (MCMV). and Alfalfa Mosaic Virus (AIMV) have 
been shown to be effective in enhancing expression (e.g. Gallie et al. Nucl. Acids Res. 15: 
8693-8711 (1987); Skuzeski et al. Plant Molec. Biol. 15: 65-79 (1990)) 

Example 3: Transformation of Monocotyledons 

Transformation of monocotyledon species such as wheat or maize has become 
routine. Preferred techniques include direct gene transfer into protoplasts using PEG or 
electroporation techniques, and particle bombardment into callus tissue. Transformations 
can be undertaken with a single DNA species or multiple DNA species (i.e. co- 
transformation) and both these techniques are suitable for use with this invention. Co- 
transformation may have the advantage of avoiding complex vector construction and of 
generating transgenic plants with unlinked loci for the gene of interest and the selectable 
marker, enabling the removal of the selectable marker in subsequent generations, should 
this be regarded desirable. However, a disadvantage of the use of co-transformation is the 
less than 100% frequency with which separate DNA species are integrated into the genome 
(Schocher et al. Biotechnology 4: 1093-1096 (1986)). 

EP-A-292 435 (to Ciba-Geigy). EP-A-392 225 (to Ciba-Geigy) and WO 93/07278 (to 
Ciba-Geigy) describe techniques for the preparation of callus and protoplasts from an elite 
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inbred line of maize, transformation of protoplasts using PEG or electroporation. and the 
regeneration of maize plants from transformed protoplasts. Gordon-Kamm et al., Plant Cell 
2: 603-618 (1990)) and Fromm et al., Biotechnology 8: 833-839 (1990)) have published 
techniques for transformation of A188-derived maize line using particle bomSardment. 
Furthermore, WO 93/07278 (to Ciba-Geigy) and Koziel et al., Biotechnology 11: 194-200 
(1993)) describe techniques for the transformation of elite inbred lines of maize by particle 
bombardment. This technique utilizes immature maize embryos of 1 .5-2.5 mm length 
excised from a maize ear 14-15 days after pollination and a PDS-1000He Biolistics device 
for bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques 
utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has been 
described for Japonica-types and Indica-types (Zhang et al.. Plant Cell Rep 7: 379-384 
(1988); Shimamoto etal. Nature 338: 274-277 (1989); Datta et al. Biotechnology 8: 736-740 
(1990)). Both types are also routinely transformable using particle bombardment (Christou 

et al. Biotechnology 9: 957-962 (1991)). 

EP-A-332 581 (to Ciba-Geigy) describes techniques for the generation, transformation 
and regeneration of Pooideae protoplasts. These techniques allow the transformation of 
Dactylis and wheat. Furthermore, wheat transformation has been described by Vasil et al.. 
Biotechnology 10: 667-674 (1992)) using particle bombardment into cells of type C long- 
term regenerable callus, and also by Vasil et al.. Biotechnology 11: 1553-1558 (1993)) and 
Weeks et al.. Plant Physiol. 102: 1077-1084 (1993) using particle bombardment of 
immature embryos and immature embryo-derived callus. A preferred technique for wheat 
transformation, however, involves the transformation of wheat by particle bombardment of 
immature embryos and includes either a high sucrose or a high maltose step prior to gene 
delivery. Prior to bombardment, any number of embryos (0.75-1 mm in length) are plated 
onto MS medium with 3% sucrose (see Murashige & Skoog. Physiologia Plantarum 15: 473- 
497 (1962)) and 3 mg/l 2,4-D for induction of somatic embryos which is allowed to proceed 
in the dark. On the chosen day of bombardment, embryos are removed from the induction 
medium and placed onto the osmoticum (i.e. induction medium with sucrose or maltose 
added at the desired concentration, typically 15%). The embryos are allowed to plasmolyze 
for 2-3 h and are then bombarded. Twenty embryos per target plate is typical, although not 
critical. An appropriate gene-carrying plasmid (such as pCIB3064 or pSG35) is precipitated 
onto micrometer size gold particles using standard procedures. Each plate of embryos is 
shot with the DuPont Biolistics helium device using a burst pressure of -1000 psi using a 
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standard 80 mesh screen. After bombardment, the embryos are p.aced back into the dark 
to recover tor about 24 h (stil. on osmoticum). After 24 hours, the embryos are removed 
trom the osmoticum and placed back onto induction medium where they stay for about a 
month before regeneration. Approximate* one month iaterthe embryo exp.ants w.th 
developing embryonic ca.ius are transferred to regeneration medium (MS ♦ 1 mg/hter NAA. 
5 mg/Hter GA). further containing the appropriate se.ection agent (10 mg/i basta ,n the case 
of PC.B3064 and 2 mgfl methotrexate in the case of P SOG35) ^J^^^ 
month, deveioped shoots are transferred to .arger steri.e conta.ners known as GA7s wh,ch 
contained ha.f-strength MS, 2% sucrose, and the same concentration of se.ect.on agent. 

Exampie 4: MDMV-B Res.stacne Conferred by Express.on of Translations.* AUered 

Viral Transripts 

Our research has focused on Coning and sequencing the remainder of the MDMV-B 
genome. We have disciosed the majority of the MDMV-B sequence in this application. We 
Le identified coding regions within the MDMV-B coding region based on conserved moffs 
previously identified in other potyviruses. The regions of the virus selected for use as 
ransgenes have been the MDMV-B non-structural proteins (i.e. Replicase. Proteinase, and 
He.icase). These regions were targeted based on the expected higher degree o, sequence 
_tion within these genes among strains of MDMV. We predict that the use of these 
regions W, give the highest probabi.ity of obtaining resistance to mu.tipie stra.ns of MDMV 
when transformed into elite maize inbreds. The sequences have been used to transform 
maize plants for the purpose of conferring virus resistance. 

Maize dwarf mosaic virus strain B (MDMV-B) was obtained from Dr. S. Jensen 
(University of Nebraska-Lincoln) and maintained in a susceptible maize inbred by senal 
inoculation. Virus was prepared for inocu.ation as previously described (see Law. M. D. 
al Phytopathology 79:757-761. (1988)). 

The virus was purified from two week old infected maize tissue by the fol.ow.ng 
protocol. The harvested tissue was homogenized with 0.2 sodium acetate. P H 5.0 
containing 0,% b-mercaptoethano, (1 :6 ratio W:V) in a blende, ~Z7soZL* 
fUtered through cheesecloth and then centrifuged for 15 minutes at 6000 RPM (Sorva GSA 
rotor) The recovered supernatant was then filtered through glass woo, and adjusted to a 
concentration of 0.5% Triton X-100 and 0.2M NaCI. The virus was precipitated from the 
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solution by adding PEG 8000 (8% final concentration) and then stirring for 2 hours at 4_C. 
The virus was recovered by centrifugation for 15 minutes at 8,000 RPM (Sorvall GSA rotor). 

The resulting pellet containing the virus was resuspended by stirring in 0.1 M Tris pH 
6 5 containing 0.032 M sodium citrate. The virus solution was clarified by centrifugation 
through a 20% sucrose pad for 2 hours at 28.000 RPM (SW28 rotor). The recovered pellet 
was resuspended in 10 ml of 0.1 M Tris pH 6.5 containing 0.032 M sodium citrate. The 
supernatent was adjusted to a concentration of 34% cesium sulfate and centrifuged for 14 
hours at 48,000 RPM (Ti 70.1 rotor). The opalescent band containing the virus was 
removed and dialyzed against 0.1 M Tris pH 6.5 containing 0.032 M sodium citrate. Viral 
RNA was isolated from the purified virions by phenol extraction and ethanol precipitation. 

The isolated RNA was then used as template for cDNA preparation using oligo dT 
primers. The preparation of cDNA clones were performed by standard procedures as 
described (see Sambrook. J. et al., Molecular Cloning: A Laboratory Manual". Cold Spring 
Harbor Laboratory Press, (1989)). 

Constructs were prepared to specific regions of the MDMV-B genome by PCR 
amplification from cDNA clones. The region amplified by PCR was typically 1200 to 1400 
nucleotides in length and was confirmed by sequencing. Constructs were prepared to the 
regions of the MDMV-B genome which encode the viral replicase (Nib), proteinase (Nla), 
and cylindrical inclusion protein (CIP). These regions were selected based on the higher 
sequence conservation within these regions between members of the potyvirus family. The 
constructs corresponding to a specific viral coding region were altered during PCR 
amplification by nucleotide substitutions within the primers. A methionine translation 
initiation codon was generated at the first codon preceding the first native codon and a 
termination codon was created at the seventh codon in all constructs tested. This would 
create a mRNA only capable of translating small peptides. The constructs were then ligated 
into either the pUBA plasmid (see Toki et al. Plant Physiol. 100:1503-1507. (1 992)) or the 
PCIB4421 plasmid. The pUBA plasmid contained the Ubiquitin promoter and the NOS 
terminator while P CIB4421 contained the maize phosphoenolpyruvate carboxylase (PEPC) 
promoter and the 35S terminator. The plasmid constructs were then verified by DNA 
sequencing. 

The constructs used in this example to transform maize plants have been designated 
pCIB5018 and pCIB 5019. pCIB5018 was constructed by ligating the PCR amplified Nla 
fragment (nucleotides 4452 -5744 of SEQ ID No. 1) into pCIB4421. The Nla fragment used 
for ligation had previously been altered by insertion of an ATG codon immediately before 
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,he first nucleotide of the first codon (i.e. the G at position 4452 of SEQ ID No. 1) and 
substitution of a thymidine (T) for the adenine (A) at nucleotide 4470 of SEQ ID No. 1 to 
create a premature stop codon. pCIB5019 was constructed by ligating the altered N.a 
fragment described above into the pUBA plasmid. 

MicroproiectilB Bomba rHmpnt Protocols 

Plasmid DN A was precipitated onto 1 mm gold microcarrier particles as described in 
the DuPont Bio.istic manual. 5mg of plasmid DNA containing a synthetic phosphinothrycm 
acetyltransferase selectable marker gene and 5mg of either P C.B5018 or P CIB5019 were 
added per 50ml of prepared microcarrier. The synthetic phosphinothrycin acety.transferase 
selectable marker gene provides resistance to the same selection agents as the BAR gene 
(see Kramer, C. et al. Planta 190: 454-458 (1993)). Bombardment of tissue was earned out 
with the DuPont PDS-1000He Biolistic" device. An additional 150x150mesh/linear inch 
screen was inserted 2cm below the stopping screen. Immature embryos were bombarded 
with 1550 P si rupture discs on a plate angled 6-8cm below the stopping screen to max.m,ze 
scutellum exposure to panicles. Type I callus was placed 4cm below the stopping screen 
and 900 P si rupture discs were used in bombardment. All plates for both explant types were 
bombarded twice. 

Immature Em hr yn Fxnlant S ™"™ Initiation and Selection 

immature embryos of a proprietary Ciba elite line (CG00526) were used as the in.t.al 
explant source in microprojecti.e-mediated transformation. Embryos were excised from the 
ears 10-14 days post-pollination, when 1-2mm in length. After surface sterilization in a 10% 
Clorox solution, embryos were plated embryonic axis down on the surface of the agar- 
solidified medium. Embryos were plated onto Duncan's »D" callus induction medium plus 
5mg/l chloramben. 2% sucrose. 12mM proline and either the organic amendments spec-fed 
in Duncan's (2DG4) or a modified version (2DA1) which omits the casein hydrolysate and 
adds the amino acids minus glutamine and asparagine from Kao and Michayluk's W 
medium (see Kao and Michay.uk. Planta 126:105-110. (1975)). The plated embryos were 
kept in a 25 C dark culture room continuously until the regeneration phase was initiated. 
The day after plating the embryos were transferred to the appropriate G4 or A1 med,a 
containing 12% sucrose at least four hours prior to microprojectile bombardment. Th,rty-s,x 
embryos were arranged in a 2-3 cm circle in the center of the plate. The embryos rema.n 
on the 12% sucrose plate overnight after bombardment. The following day. embryos were 
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transferred either to 2DG4 + 5 chloramben + the equivalent of a 10mg/l concentration of 
Basta"herbicide (glufosinate ammonium) or 2DA1 +5 chloramben + 5mg/l Basta. 

Fourteen days from the initial excision and plating, developing compact, organized 
type I callus was excised from the original explant and subcultured to either 2DG4 + 0.5mg/l 
2,4-D + 10mg/l Basta or 2DA1 +0.5mg/l 2.4-D + 5mg/l Basta. Viable, healthy callus was 
serially subcultured every fourteen days during the selection phase. All tissue was then 
transferred to Duncan's medium, modified by omitting all amino acids, plus 2% sucrose. 
0.5mg/l 2.4-D and 10mg/l Basta (2DG8) at the end of eight weeks. After a two week 
passage on the G8 medium, all living tissue was transferred to regeneration medium. 

Type I Explant Source Initiatio n and Selection 

Immature embryos of the Ciba elite line (CG00526) were plated embryonic axis down 
onto 2DG4 + 5 chloramben at the 1-2mm length size. The developing compact, highly 
organogenic (type I) callus was excised from the original embryo explant after fourteen days 
and maintained serially on 2DG4 + 0.5mg/l 2.4-D by subculturing to fresh medium every 
ten-fourteen days. When the callus lines obtained were two to three months old. they were 
prepared for microprojectile bombardment. The tissue was subcultured to fresh medium in 
small pieces approximately 1-3mm in size one to two days prior to bombardment. On the 
day of bombardment, the tissue was arranged in a 2-3cm circle in the center of a DA1 plate 
containing 12% sucrose and 0.5mg/l 2.4-D at least four hours prior to bombardment. The 
callus was kept on the plate after bombardment overnight, and transferred the next day to 
2DA1 +0.5mg/l 2.4-D + 10mg/IBasta. Viable, healthy callus was serially subcultured on the 
same medium every fourteen days during the selection phase. All tissue was transferred to 
Duncan's medium, modified by omitting all amino acids, plus 2% sucrose, 0.5mg/l 2.4-D and 
10mg/l Basta (2DG8) at the end of eight weeks. After a two week passage on the G8 
medium, all living tissue was transferred to regeneration medium. 

Reneneration Playlet E stablishment of Immature Embryo and Type I Explant Source 
Experiments 

Tissue for regeneration was moved to a 25_C light culture room under a 16 hour 
photoperiod. Regeneration medium consisted of Murashige and Skoog's (MS) salts and 
vitamins. 3% sucrose + 0.25mg/l ancymidol. 1.0mg/l NAA . 0.5mg/l kinetin and Smg/I Basta. 
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After a two week passage on the regeneration medium with growth regulators, the tissue 
was transferred to MS medium + 3mg/IBasta and no additional growth regulators. Plantlets 
reaching 14cm length were transferred from plates to Magenta«GA-7 boxes containing MS 
medium (0.75X concentration* 1% sucrose) and no Basta for root development. Plantlets 
with sufficient root development were transplanted to soil and moved to the greenhouse. 
Plantlets were hardened off in a 70% humidity phytotron for one to two weeks before 
moving the plants to the greenhouse range. The greenhouse conditions were as follows: 
55% humidity, 400 Einsteins light intensity. 16 hour photoperiod. 80-84_F Day temperature, 
64-68_F Night temperature. Plants were allowed to grow to maturity in the greenhouse and 
were either selfed or backcrossed to the parental line in the Ti generation. 



Analysis of Tn Plants 

To plantlets were first assayed by polymerase chain reaction (PCR) to detect the 
selectable marker, the gene of interest and an alcohol dehydrogenase (Adh) gene 
sequence as an internal assay control. Plantlets were assayed at approximately eight to 
fourteen cm height, when the plantlets were still in the GA-7 boxes. Standard PCR 
conditions were used (see Kramer. C. et al. Planta 190: 454-458 (1993)). The Adh internal 
control primer pair sequence was TGCATGTCGGTTGTGTTGCA (SEQ ID NO. 3) and 
CTCAGCAAGTACCTAGACCA (SEQ ID No. 4). The primer pair sequence for the synthetic 
PAT gene was TGTCTCCGGAGAGGAGACC* (SEQ ID No. 5) and 
CCAACATCATGCCATCCACC (SEQ ID No. 6). The primer pair sequence for the Nla 
proteinase gene is GCGGGATCCATGGGGAAGAACAAACGCAGTTGA (5') (SEQ ID No. 7) 
and GCGGAGCTCTTACTCTTCAACGCTCGCGTC (3') (SEQ ID No. 8). The parameters for 
PCR amplification for all primer pairs were 45 sec at 94 _C. 30 sec at 62_C. 30 sec at 72_C 
plus a 3 sec/cycle extension elongation for 40 cycles. 

Plantlets identified by PCR to be transformed were analyzed by Northern blot assay 
for mRNA transcript of the gene of interest (Nla proteinase). Plants were assayed for 
mRNA expression either while in the GA-7 containers or when the plants had been 
acclimated in the greenhouse. The probe was a 1 303 bp fragment of the Nla gene excised 
by a BamH1/Sacl restriction digest of the pCIB5019 plasmid. Labeling was carried out with 
the Gibco/BRL RadPrime DNA Labeling kit as described by the manufacturer. Northern blot 
protocols were performed as described (see Sambrook, J. et al.. "Molecular Cloning: A 
Laboratory Manual". Cold Spring Harbor Laboratory Press. (1989)). 
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Analysis of Ti Plants 

Ti seed harvested from the T 0 plants was first dried down in the drying room for one 
to two weeks before planting. Seed were planted directly in flats and watered in. The flats 
were bottom watered with either a 0.15% volume/volume Basta solution or with water two 
days after planting. Four different transformation events were tested for herbicide and 
disease resistance in this example, as well as the wild-type elite control. Forty seeds from 
each individual transformed plant were tested initially. 20 in Basta and 20 in the water 
control. Seven days after the first Basta drench, a second drench was carried out in the 
same manner. 

All plants were inoculated with MDMV-B following the second Basta soil drench when 
the plants were 4-5 inches in height (3-5 Leaf Stage). A second virus inoculation was 
performed on all plantlets 4-6 days after the first inoculation to insure infection. Plants were 
scored for viability in the plus and minus Basta drench and for the presence or absence of 
viral symptoms at the end of two and a half weeks. 

Plants which showed resistance to the virus, as measured by the absence of viral 
symptoms, and a susceptible sibling were assayed by Northern blot analysis using the NIa 
fragment as described above. The resistant plants were also assayed by ELISA and 
Western blot analysis for the presence of MDMV-B coat protein in the plants. 

ELISA and Western Bint analysis o f the transgenic plants. 

The primary antibody used for both assays was a polyclonal antibody specific for the 
MDMV-B coat protein which was obtained from Dr. S. Jensen (University of Nebraska- 
Lincoln). The second antibody was an affinity purified polyclonal IgG alkaline phosphatase 
labeled goat anti-rabbit antibody (Kirkegard and Perry Laboratories, Gaithersburg. 
Maryland). 



ELISA Analysis 



Tissue samples were taken from all plants not exhibiting characteristic MDMV-B 
symptoms and from one infected plant. Samples were also taken from healthy and infected 
CG00526 plants as controls. The samples (two leaf punches- 1 cm in diameter) were taken 
from both the inoculated leaf and the youngest available leaf. The tissue samples were 
homogenized in 0.400 ml of borate buffered saline (100mM boric acid. 25mM sodium 
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borate. 75mM sodium chloride). Aliquots (50ml) of each sample were applied to a ethanol 
washed ELISA plate and incubated overnight at 4_C. The plates were then washed once 
with ELISA wash buffer (lOmM Tris-HCI. 0.05% Tween-20, 0.02% sodium azide). and 
blocked with ELISA block/diluent (10mM sodium phosphate, 140mM sodium chloride, 
0 05% Tween-20, 1% BSA, 0.02% sodium azide) for one hour at room temperature. The 
plates were washed three times with ELISA wash buffer. The primary antibody was applied 
at a 1 :5000 dilution in 50ml of ELISA block/diluent and incubated for 2 hours at 37_C and 
then washed three times with ELISA wash buffer. The second antibody was applied at a 
concentration of 1 .5mg/ml in ELISA block/diluent and incubated for 2 hours at 37_C. The 
plates were washed three times with ELISA wash buffer and were developed by incubation 
in ELISA substrate (Kirkegard and Perry) for 30 minutes at room temperature. The reaction 
was stopped by the addition of 50ml of 3M sodium hydroxide. The plates were read with a 
SLT 340 ATTC ELISA plate reader (SLT Labinstruments) at 405nm. 



Western Pint Analysis 



Western blot analysis was performed on samples used for ELISA analysis. A 2ml 
aliquot of the samples was diluted into 10ml of 1X loading dye (Novex Inc). The samples 
were electrophoresed on an 8%-16% Tris-glycine polyacrylamide gel (Novex) in Tris running 
buffer (25mM Tris-Base, 192mM glycine. 0.1%SDS) at 120 volts for approximately 2.5 
hours The gel was blotted onto nitrocellulose using a Biorad blotting apparatus in transfer 
buffer (25mM Tris-Base. 192mM glycine, and 10% methanol) at 120 volts for 45 minutes. 
The filter was blocked with blocking/diluent (1X TBS. (20mM Tris-Base, 500mM NaCI. pH 
7 5) 0.05% Tween-20. 1% BSA. 5% lamb serum) at room temperature for 45 minutes. The 
filter was incubated with the primary antibody, described above, at a dilution of 1 :1 000 in 
blocking/diluent at room temperature for 1 .25 hours. The filter was washed for five minutes 
in 1XTTBS. (1X TBS. 0.05% Tween-20). The second antibody, described above, was 
incubated with the filter in blocking/diluent at a dilution of 1 :1 000. for 1 .25 hours at room 
temperature. The filters were washed twice for 5 minutes in 1XTTBS followed by a single 
wash in 1XTBS for 5 minutes. The filter was developed with Nitro Blue Tetrazolium (NBT) 
and 5-bromo-4-chloro-3-indolyl phosphatase (BCIP) in 0.1M Tris-HCI pH 9.5 as described 
by the manufacturer. The filter was developed for approximately 20 minutes and then 
stopped by washing the filter with water. 
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r-hararterizati nn nf thfi MDMV-B Genome 

Clones have been isolated and sequenced representing 8530 nucleotides of the 
MDMV-B genome. We have identified a single large open reading frame as-would be 
expected of a virus belonging to the potyvirus family. We have identified regions of the 
nonprotein which would encode the coat protein (nucleotides 7308-8291 of SEQ ID No. 1 
and amino acids 2436-2763 of SEQ ID No. 2). the putative RNA dependent RNA 
polymerase (RdRp) termed Nib (nucleotides 5745-7307 of SEQ ID No. 1 and amino acids 
1915-2435 of SEQ ID No. 2), the Nla proteinase (nucleotides 4452-5744 of SEQ ID No. 1 
and amino acids 1484-1914 of SEQ ID No. 2). the 6K protein (nucleotides 4293-4451 of 
SEQ ID No 1 and amino acids 1431-1483 of SEQ ID No. 2). cylindrical inclusion prote.n 
(CP) containing the helicase(nucleotides 2376-4292 ol SEQ ID No. 1 and amino acids 792- 
1430 of SEQ ID No. 2). P3 proteinase (nuc.eotides 1 134-2375 of SEQ ID No. 1 and ammo 
acids 378-791 of SEQ ID No. 2). and a portion of the helper components proteinase (HC- 
Pro)(nucleotides 3-1 133 of SEQ ID No. 1 and amino acids 1-377 of SEQ ID No. 2). 
Identification was based on the location of putative cleavage sites and conserved motifs. 
The MDMV-B sequence of the CP region from our isolate was 99% identical to the 
previously sequenced MDMV-B CP and 78% identical to the MDMV-A CP. Further 
comparisons could not be made due to the lack of additional sequence to other MDMV 
strains The sequence of MDMV-B was then compared to other potyviruses and was found 
to exhibit approximately 60% nucleotide sequence identity to other potyviruses. The level 
of identify varied little when sequences encoding the different proteins were used for the 
comparison. 



T n Analysis 

Eighteen lines (individual transformation events from selection and regeneration) were 
obtained from the experiments in this example. 17 of the 1 8 .ines were positive by PCR fo, 
the selectable marker, and 14 for the gene of interest. All 14 events which were PCR 
positive for the Nla gene were also positive for expression in the Northern ana.ys,s. The 
predominate mRNA species was approximately 1300 nucleotides in length which would 
correspond to the predicted size of the transgene. A smaller species approximately 1000 
nucleotides in length was also detected which most likely arose by processing. Differences 
in mRNA expression levels were seen between different events as well as between 
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individual plants (siblings) from a given event. All PCR positive plants were used for seed 
production (T-|). 

T-| Analysis . 
Four plants from two different events were identified to be resistant to the virus inoculation 
as evidenced by the absence of visual symptoms. There was no correlation to Basta 
tolerance in this example. Northern analysis of the four plants showed no detectable NIa 
transcript in the four resistant plants, while an infected sibling plant from the same ong.nal 
ear (T 0 ) was shown to have high levels of viral RNA. The levels of MDMV-B in the infected 
sibling was similar to the levels seen in the control CG00526 plants. 

The resistant plants were also evaluated for the presence of viral coat protein by 

ELISA. 

The four values obtained for each sample, duplicate samples from the inoculated leaf and 
non-inoculated leaf, were averaged and a comparison made against the infected and 
healthy controls. No detectable virus was present in the resistant transformed plant lines by 
ELISA at which the threshold of detection was approximately 2 ng of virus per sample. In 
contrast, the transformed siblings which exhibited symptoms contained levels of virus 
similar to that seen in the infected CG00526 control plants. These results show conclude 
evidence that the four plants were immune to MDMV-B infection (i.e. not supporting virus 
replication). The resistance was durable in that the resistant plants withstood two 
inoculations with high MDMV-B inoculum concentrations. The inoculum concentrations 
used in these experiments typically result in symptoms within four days in susceptible plant 
lines. Yet, the resistant plants have not produced visible symptoms nor detectable virus six 
weeks following inoculation. 



Various modifications of the invention described herein will become apparent to those 
skilled in the art. Such modifications are intended to fall within the scope of the appended 
claims. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION : 



(i) APPLICANT: 

(A) NAME: CIBA-GEIGY AG 

(B) STREET: Klybeckstr. 141 

(C) CITY: Basel 

(E) COUNTRY: Switzerland 

(F) POSTAL CODE (ZIP) : 4002 

(G) TELEPHONE: +41 61 69 11 11 

(H) TELEFAX: + 41 61 696 79 76 

(I) TELEX: 962 991 



t±±\ TITLE OF INVENTION: USE OF TRANSLATIONALLY ALTERED RNA TO CONFER 
(XI) TITLE OF HWh^ixuN. RESiffrANCE TO mqsAIC VIRUS AND OTHER 

^O^OCXnYI£[X)NOUS PLANT VIRUSES 



(iii) NUMBER OF SEQUENCES: 8 



(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC canpatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30B 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENCTH: 8543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: RNA (genomic) 



(iii) HYPOTHETICAL: NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 3.. 8291 
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(D) OTHER INFORMATION : /product = "polyprotein encoded by 
MEMV-B genome" 

(ix) FEATURE: 

(A) NAME/KEY: 3'UTR 

(B) LOCATION: 8292.. 8530 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 3.. 1133 

(D) OTHER INFORMATION : /product = "3 -prime sequence for 

HC-Pro" 

(ix) FEATURE: 

(A) NAME/KEY: miscJRNA 

(B) LOCATION: 1134.. 2375 

(D) OTHER INFORMATION: /product = *'P3 proteinase 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 2376.-4292 t 

(D) OTHER INFORMATION: /product= "cylindrical inclusion 

protein" 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 4293.. 4451 

(D) OTHER INFORMATION: /product = "K2 (6kD protein) " 

(ix) FEATURE: 

(A) NAME/KEY: miscJRNA 

(B) LOCATION: 4452.. 5744 

(D) OTHER INFORMATION : /product = "NIa proteinase" 

(ix) FEATURE: 

(A) NAME/KEY: misc_RNA 

(B) LOCATION: 5745.. 7307 

(D) OTHER INFORMATION: /product= "Nib replicase" 

(ix) FEATURE: 

(A) NAME/KEY: misc_KNA 

(B) LOCATION: 7308.. 8291 

(D) OTHER INFORMATICS : /product = "coat protein" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1 : 

UC GAA GAG AAA CAA CGA GAG UAU CUU GCA AAG GAU CAA AAA CUA UCC 
Glu Glu Lys Gin Arg Glu Tyr Leu Ala Lys Asp Gin Lys Leu Ser 
1 5 10 15 

AGA AUG AUA CAA UUU AUC AAA GAA AGG UGC AAU CCA AAA UCG CAU 

Arg Met He Gin Phe He Lys Glu Arg Cys Asn Pro Lys Phe Ser His 

20 25 30 



47 



95 
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UUA CCA ACG CUA UGG CAA GUC GCG GAA ACA AUA GGG CAC UAU ACU GAU 143 
Leu Pro Thr Leu Trp Gin Val Ala Glu Thr lie Gly His Tyr Thr Asp 

35 40 45 

AAC CAG UCA AAG CAA AUA AUG GAU GUU AGC GAA GCG CUC AUC AAA GUU 191 
Asn Gin Ser Lys Gin lie Met Asp Val Ser Glu Ala Leu He Lys Val 
50 55 60 

AAU ACU CUG ACU CCU GAU GAU GCU AUG AAA GCA AGC GCA GCG UUA CUU 239 
Asn Thr Leu Thr Pro Asp Asp Ala Met Lys Ala Ser Ala Ala Leu Leu 
65 70 75 

GAA GUG UCG CGA UGG UAU AAG AAU CGU AAG GAG UCA CUC AAA ACU GAC 287 
Glu Val Ser Arg Trp Tyr Lys Asn Arg Lys Glu Ser Leu Lys Thr Asp 
80 85 90 95 

UCA UUG GAA UCU UUU AGA AAU AAA AUA UCA CCA AAG AGU ACA AUA AAU 335 
Ser Leu Glu Ser Phe Arg Asn Lys He Ser Pro Lys Ser Thr He Asn 

100 105 HO 

GCA GCU UUA AUG UGC GAU AAU CAA UUG GAU AAA AAU GCA AAU UUU GUA 383 
Ala Ala Leu Met Cys Asp Asn Gin Leu Asp Lys Asn Ala Asn Phe Val 

115 120 125 

UGG GGU AAU AGG GAA UAC CAC GCC AAA CGA UUU UUC GCA AAC UAU UUU 431 
Trp Gly Asn Arg Glu Tyr His Ala Lys Arg Phe Phe Ala Asn Tyr Phe 
130 135 140 

NAA GCA GUG GAU CCC ACA GAU GCA UAU GAA AAG CAC GUC ACA CGG UUC 479 
Xaa Ala Val Asp Pro Thr Asp Ala Tyr Glu Lys His Val Thr Arg Phe 
145 150 155 

AAC CCU AAU GGU CAA CGA AAG UUA UCA AUA GGA AAG UUA GUU AUC CCA 527 
Asn Pro Asn Gly Gin Arg Lys Leu Ser He Gly Lys Leu Val He Pro 
160 165 170 175 

CUA GAC UUU CAA AAG AUU AGA GAA UCA UUC GUU GGA CUC UCG AUA AAU 575 
Leu Asp Phe Gin Lys He Arg Glu Ser Phe Val Gly Leu Ser He Asn 

180 185 190 

AGA CAA CCG CUG GAU AAA UGU UGU GUU AGC AAG AUC GAA GGA GGG UAU 623 
Arg Gin Pro Leu Asp Lys Cys Cys Val Ser Lys He Glu Gly Gly Tyr 

195 200 205 

AUA UAC CCA UGU UGC UGC GUC ACA ACA GAA UUU GGU AAA CCA GCA UAC 671 
He Tyr Pro Cys Cys Cys Val Thr Thr Glu Phe Gly Lys Pro Ala Tyr 
210 215 220 

UCU GAG AUA AUA CCU CCA ACG AAA GGG CAU AUA ACA AUA GGC AAU UCU 719 
Ser Glu He He Pro Pro Thr Lys Gly His He Thr He Gly Asn Ser 
225 230 235 

AUU GAU UCA AAG AUU GUG GAC UUG CCA AAU ACA ACA CCA CCC AGC AUG 767 
He Asp Ser Lys He Val Asp Leu Pro Asn Thr Thr Pro Pro Ser Met 
240 C 245 250 255 
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Tyr lie Ala Lys Asp Gly Tyr Cys Tyr lie Asn He Phe Leu ALa ai 

260 255 

AUG AUC AAC GUU AAJJ GAA GAA UCU GCC AAG GAU UAU AOS AAA UUU UUG 
Met lie Asn Val Asn Glu Glu Ser Ala Lys Asp Tyr Thr Lys Phe Leu 

275 260 

AGG GAC GAA CUA GUU GAG CGU CUC GGA AAG UGG CCA AAG CUU AAA ^C 
S S Su SS Val Glu Arg Leu Gly Lys Trp Pro Lys Leu Lys Asp 



GUAGCAACAGCGUGUUAUGCAUUAUCUC^ADGUUUCCAGAAA^ 
S S tS Ala Cys Tyr Ala Leu Ser Val Met Phe Pro Glu lie Lys 

305 310 315 



AAU GCU GAG CUA CCU CCA AUU CUA GUU GAC CAU GAA AAU AAA UCA AUG 
^ Sa Glu Leu Pro Pro lie Leu Val Asp His Glu Asn Lys Ser Met 
320 325 

340 345 J 

AAA GCA AGC ACG AJJU GGU CAA UUA AUC AAA UUU CAA UAU GAG UCU AUG 
£i Sa Sr S 2 Gly Gin Leu He Lys Phe Gin Tyr Glu Ser Met 

355 360 



370 375 
APA UUC AAC Ai^ CUU CUU AAG AUG CUU ACG AAA AAC AUG UUC AAA CCA 
£ pS A^ tS Su Leu Lys Met Leu Thr Lys Asn Met Phe Lys Pro 



385 



390 



GAG CGC AUC AAG CAG AUA AUU GAA GAG GAA CCU UUC UUA CUU AUG ?P? 
Glu Arg Ue Lys Gin lie He Glu Glu Glu Pro Phe Leu Leu Met Met 



400 



405 



GCG AUU GCG UCU CCA ACG GUA UUA AUA GCA CUA UAU AAU AAU UGU UAU 
S S £a Sr S Thr Val Leu He Ala Leu Tyr Asn Asn Cys Tyr 



420 



425 



AITT! GAG CAA GCU AUG ACA UAC UGG AUC GUU AAG AAU CAA GGA GUU GCA 
S Su Sn Sa Met Thr Tyr Trp He Val Lys Asn Gin Gly Val Ala 

435 440 

rw- mia imp GCA CAA CUC GAA GCA UUA GCC AAG AAA ACA UCC CAG GCU 
S S £ Sa S Su Glu Ala Leu Ala Lys Lys Thr Ser Gin Ala 



450 



815 



863 



911 



959 



1007 



1055 



1103 



1151 



1199 



1247 



1295 



1343 



1391 



1439 
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465 470 475 

UUAAC^UUAGCAC^UCAQGAC^AGCCAUAUCGACC^ 
Leu Arg Leu Ala Val Ser Gly Leu Ser His He Asp Pro Ala Lys Arg 
480 485 490 495 



1487 



1535 



1583 



CUUUUGUGGUC^CACOJUC^Qrc 

Leu Leu Trp Ser His Leu Glu Ala Met Ser Thr Arg Ser Glu Met Asn 

500 505 510 

AAG GAG UUA AUA GCU GAG GGG UAU GCA CUA UAU GAC GAG CGC CUA UAC 
Lys Glu Leu He Ala Glu Gly Tyr Ala Leu Tyr Asp Glu Arg Leu Tyr 

515 520 525 

AOC CUG AUG GAA AAA AGU UAC GUA GAIJ CAA UUA AAC CAA UCA UGG GCA 1631 
Thr Leu Met Glu Lys Ser Tyr Val Asp Gin Leu Asn Gin Ser Trp Ala 
530 535 540 

GAA UUG UCA UAC UGU GGA AAA UUU UCA GCA AUA UGG CGU GUG UUC AGA 1679 
Glu Leu Ser Tyr Cys Gly Lys Phe Ser Ala He Trp Arg Val Phe Arg 
545 550 555 

GUC AGG AAG UAU UAC AAA CCG UCU UUA ACC GUG AGA AAA AGC GUA GAU 1727 
Val Arg Lys Tyr Tyr Lys Pro Ser Leu Thr Val Arg Lys Ser Val Asp 
560 565 570 575 

UUA GGC GCU GUA UAC AAU AUA UCA GCU ACG CAU CUA AUA UCA GAU UUA 1775 
Leu Gly Ala Val Tyr Asn He Ser Ala Thr His Leu He Ser Asp Leu 

580 585 590 

GOG CGG AAA AGU CAA GAU CAA GUC AGC UCU AUU UUA ACC AAA CUC CGC 1823 
Ala Arg Lys Ser Gin Asp Gin Val Ser Ser He Leu Thr Lys Leu Arg 

595 600 605 

AAC GGU UUU UAU GAU AAA UUA GAG AAA GUU AGA AUA CGA ACU AUA AAA 1871 
Asn Gly Phe Tyr Asp Lys Leu Glu Lys Val Arg He Arg Thr He Lys 
610 615 620 

AQG GUU UAU UGG UUU AUA CCU GAU AUA UUU AGA CUC GUG CAC AUA UUC 1919 
Thr Val Tyr Trp Phe He Pro Asp He Phe Arg Leu Val His He Phe 
625 630 635 

AUA GUU UUG AGU UUA UUA ACU ACC AUC GCU AAC ACU AUC AUA GUA ACU 1967 
He Val Leu Ser Leu Leu Thr Thr He Ala Asn Thr He He Val Thr 
640 645 650 655 

AUG AAU GAC UAC AAG AAA UUG AAG AAG CAA CAA AGA GAA GAC GAA UAU 2015 
Met Asn Asp Tyr Lys Lys Leu Lys Lys Gin Gin Arg Glu Asp Glu Tyr 

660 665 670 

GAA GCA GAA AUU AGC GAA GUU CGC AGA AUC CAU UCU ACC UUA AUG GAA 2063 
Glu Ala Glu He Ser Glu Val Arg Arg He His *er Thr Leu Met Glu 

675 680 - 685 

GAG CGG AAG GAC AAU CUG ACG UGU GAA CAA UUU AUU GAG UAU AUG CGU 2111 
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Glu Arg Lys Asp Asn Leu Thr Cys Glu Gin Phe He Glu Tyr Met Arg 

690 69 5 700 

C^AAOC^CCACXKCUAGUUGGAGNAACAt^GACUUGACUCAC^ 2159 
Xaa Asn His Pro Arg Leu Val Gly Xaa Thr I^u Asp Leu Thr His Thr 
705 710 715 

GGU GUC AUA CAU GAA GGG AAA UCC AAU CUC GAA ACC AAU UUG GAA CAG 2207 
Gly Val He His Glu Gly Lys Ser Asn Leu Glu Thr Asn Leu Glu Gin 
720 725 730 735 

UCA AUG GCA GUU GGA ACC UUG AUA ACA AUG AUA CUU GAU OCA CAG AAA 2255 
Ser Met Ala Val Gly Thr Leu He Thr Met He Leu Asp Pro Gin Lys 

740 745 750 

AGC GAU GCU GUC UAU AAG GUG UUG AAC AAA AUG CGG ACA GUA AUU AGU 2303 
Ser Asp Ala Val Tyr Lys Val Leu Asn Lys Met Arg Thr Val He Ser 

755 760 765 

ACA AUU GAA CAA AAC GUC CCA UUC CCU UCA GUG AAU UUC UCC AAC AUC 2351 
Thr He Glu Gin Asn Val Pro Phe Pro Ser Val Asn Phe Ser Asn He 
770 775 780 

UUA ACA CCU CCA GUG GCA CAA CAG AGU GUA GAU GUU GAU GAG CCA UUA 2399 
Leu Thr Pro Pro Val Ala Gin Gin Ser Val Asp Val Asp Glu Pro Leu 
785 790 795 

ACA CUU AGC ACU GAU AAA AAU UUA ACA AUA GAC UUU GAC ACA AAU CAA 2447 
Thr Leu Ser Thr Asp Lys Asn Leu Thr He Asp Phe Asp Thr Asn Gin 
800 805 810 815 

GAU UUA CCU GCC GAU ACA UUC M33 AAU GAU GUG ACA UUU GRA GAU U3G 2495 
Asp Leu Pro Ala Asp Thr Phe Ser Asn Asp Val Thr Phe Xaa Asp Trp 

820 825 830 

UGG UCA WMU CAA UUA AGC AAC AAC AGA ACA GUG SCA CAC UAC CGA CNU 2543 
Trp Ser Xaa Gin Leu Ser Asn Asn Arg Thr Val Xaa His Tyr Arg Xaa 

835 840 845 

UGG GGG GAA AGU YCA UUG GAA UUC ACA CGA GAA AAC GCA GCC CAC ACG 2591 
Trp Gly Glu Ser Xaa Leu Glu Phe Thr Arg Glu Asn Ala Ala His Thr 
850 855 860 

AGCAUCGAACUUC^CACUCAAACAUUGAGAGGGAAUUCUUGCW 2639 
Ser He Glu Leu Ala His Ser Asn He Glu Arg Glu Phe Leu Leu Arg 
865 870 875 

GGA GCA GUC GGC UCG GGA AAA UCC ACU GGG UUA CCA UAC CAU CUU AGC 2687 
Gly Ala Val Gly Ser Gly Lys Ser Thr Gly Leu Pro Tyr His Leu Ser 
880 885 890 895 

AUG CGC GGA AAA GUG CUG CUA CUA GAG CCU ACA AGA CCG CUA GCU GAG 2735 
Met Arg Gly Lys Val Leu Leu Leu Glu Pro Thr Arg Pro Leu Ala Glu 

900 905 91 
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AAC GUG UGU AGG CAA CUA CAA GGA CCG CCA UUU AAC GUA AGU CCA ACU 2783 
Asn Val Cys Arg Gin Leu Gin Gly Pro Pro Phe Asn Val Ser Pro Thr 

915 920 925 

CUUCAAAIXJCGUGGAUUAAGUUCCUIXJ 2831 
Leu Gin Met Arg Gly Leu Ser Ser Phe Gly Cys Thr Pro lie Thr lie 
930 935 940 

AUG ACA UCU GGU UUC GCA UUG CAC AUG UAC GCA AAU AAU CCA GAU AAA 2879 
Met Thr Ser Gly Phe Ala Leu His Met Tyr Ala Asn Asn Pro Asp Lys 
945 950 955 

ALIA UCU GAG UAC GAU UUC AUA AUC UUU GAU GAA UGU CAU AUA AUG GAA 2927 
lie Ser Glu Tyr Asp Phe He He Phe Asp Glu Cys His He Met Glu 
960 965 970 975 

GCA CCA GCG AUG GCC UUU UAU UGU UUA CUC AAA GAA UAU GAA UAU CGA 2975 
Ala Pro Ala Met Ala Phe Tyr Cys Leu Leu Lys Glu Tyr Glu Tyr Arg 

980 985 990 

GGA AAA AUU AUC AAG GUA UCA GCU ACG CCU CCA GGA AGG GAG UGU GAA 3023 
Gly Lys He He Lys Val Ser Ala Thr Pro Pro Gly Arg Glu Cys Glu 

995 - 1000 1005 

UUC ACA ACA CAA CAU CCA GUA GAC AUC CAU GUU UGU GAG AAU CUA ACU 3071 
Phe Thr Thr Gin His Pro Val Asp He His Val Cys Glu Asn Leu Thr 
1010 1015 1020 

CAG CAA CAG UUU GUU AUG GAA CUC GGG ACU GGU UCA ACC GCA GAU GCU 3119 
Gin Gin Gin Phe Val Met Glu Leu Gly Thr Gly Ser Thr Ala Asp Ala 
1025 1030 1035 

ACG AAG UAC GGA AAU AAU AUC UUA GUU UAU GUA GCA AGC UAU AAU GAC 3167 
Thr Lys Tyr Gly Asn Asn He Leu Val Tyr Val Ala Ser Tyr Asn Asp 
1040 1045 1050 1055 

GUC GAU UCA UUG UCG CAA GCA CUA GUC GAA CUU AAA UUU UCC GUA AUC 3215 
Val Asn Ser Leu Ser Gin Ala Leu Val Glu Leu Lys Phe Ser Val He 

1060 1065 1070 

AAA GUG GAU GGC CGA ACA AUG AAA CAA AAC ACA ACA GGA AUC AUU ACA 3263 
Lys Val Asp Gly Arg Thr Met Lys Gin Asn Thr Thr Gly He He Thr 

1075 1080 1085 

AAC GGU ACC GCA CAA AAG AAG UGU UUU GUU GUC GCA ACG AAU AUA AUU 3311 
Asn Gly Thr Ala Gin Lys Lys Cys Phe Val Val Ala Thr Asn He He 
1090 1095 1100 

GAG AAU GGC GUC ACA CUA GAU AUU GAU GUU GGU GUC GAC UUC GGA CUU 3359 
Glu Asn Gly Val Thr Leu Asp He Asp Val Gly Val Asp Phe Gly Leu 
1105 1110 H15 

AAA GUC UCA GCU GAC UUG GAC GUU GAC AAC AGG GCG GUA UUG UAU AAA 3407 
Lvs Val Ser Ala Asp Leu Asp Val Asp Asn Arg Ala Val Leu Tyr Lys 
li20 H25 H30 H35 
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Arg val Ser He Sec Tyr Gly Glu Leu Ilefiln Arg Leu Gly ArgVal 

rxv* AGA AAU AAA CCU GGU ACA GUU AUU CGA AOC GGA AAA ACA AUG AAA 
S S US 5^ S Gly ttr Val lie Arg He Gly Lys *r Met Lys 

1155 1160 11D 



3455 



GGU UUG CAG GAA AUU CCA GCA AUG AUC 
Glv Leu Gin Glu lie Pro Ala Met lie 
1170 I* 75 

UGU UUC GCU UAC GGU CUU AAA GUU AUC 
Cys Phe Ala Tyr Gly Leu Lys Val lie 
1185 H90 

CAU CUU GCA AAG UGC ACA GUU AAA CAA 
His Leu Ala Lys Cys Thr Val Lys Gin 
1200 I 205 

GAA UUA UCA CCA UUU GUC AUG GCU GAG 
Glu Leu Ser Pro Phe Val Met Ala Glu 

1220 



GCA ACA GAG GCA GCC UUC AUG 
Ala Thr Glu Ala Ala Phe Met 

1180 

ACU CAU AAU GUU UCA ACG ACC 
Thr His Asn Val Ser Thr Thr 
1195 

GCG AGA ACC AUG AUG CAA UUU 
Ala Arg Thr Met Met Gin Phe 
1210 l 2i5 

CUC GUU AAG UUU GAU GGU UCA 
Leu Val Lys Phe Asp Gly Ser 
1225 1230 



AUGC*UCC*CAAAUACAUC^GCACUAG 

Set His Pro Gin lie His Glu Ala l*u Val Lys Tyr Lys Leu Arg Asp 

12 35 1240 

UCU GUC AUA AUG CUC AGA CCG AAU GCA CUU CCA AGG GUC AAU ^A CAU 
Ser Val lie Met Leu Arg Pro Asn Ala Leu Pro Arg Val Asn L*u His 
1250 1255 1260 



AAU UGG CUU ACA 
Asn Trp Leu Thr 
1265 

CUC GAA GAC CAC 
Leu Glu Asp His 
1280 

AAG UUG UAU GGA 
Lys Leu Tyr Gly 



GCC CGA GAU UAU AAU AGA AUA GGA UGU UCA UUA GAA 
Ala Arg Asp Tyr Asn Arg He Gly Cys Ser Leu Glu 
1270 I 275 

GUC AAA AUU CCG UAC UAC AUU AGG GGA GUU CCU GAC 
Val Lys He Pro Tyr Tyr He Arg Gly Val Pro Asp 
1285 1290 1295 

AAG CUA UAU GAU AUU AUC UUA CAG GAU AGU CCA ACU 
Lys Leu TVr Asp lie lie Leu Gin Asp Ser Pro Thr 
1300 ~ 1305 1310 



AGU UGC UAC AGU AGA CUA UCA AGU GCG UGU GCA GGU AAA GUA G^IA 
SSS S *rg Leu Ser Ser AlaCys Ala Gly Lys Val^Ala Tyr 

ACU CUG CGA ACU GAU CCA UUU UCA CUU CCA AGA ACA AUA GCA AUA AUU 
Thr Leu Arg Thr Asp Pro Phe Ser^Leu Pro Arg Thr IleAla lie He 

AAU GCC UYA AUC ACG GAG GAG UAU GCG AAG AGA GAU CAC UAU CGU AAC 
S S 52 He Jhr Glu Glu Tyr Ala Lys Arg Asp His Tyr Arg Asn 
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3551 



3599 



3647 



3695 



3743 



3791 



3839 



3887 



3935 



3983 



4031 



4079 
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1345 



1350 



1355 



AUG AUU YCA AAC CCA UCU UCA UCA CAC GCA UUC UCA CUC AAU GOG UUG 
S S 3S JSS S Ser Ser Ser His Ala Phe Ser nu Asn Gly Lju 
1360 1365 1370 

GUG UCU AUG A1JC GCU ACQ AGA UAIJ AUG AAA GAC CAC ACA AAG GAG 
Val Ser Met lie Ala Thr Arg Tyr Met Lys Asp His Thr Lys Glu ^Asn 

1380 I 385 

lie Asp Lys Leu He Arg Val Arg Asp Gin Leu Leu Glu Phe Gin Gly 

2395 1400 

ACU GGA AUG CAA UOU CAA GAU CCA UCA GAA CUC AUG GAA AUU GOG GCU 
Thr Gly Met Gin Phe Gin Asp Pro Ser Glu Leu Met Glu lie Gly Ala 
1410 1^15 1420 

OT CAACACAGUUAUUCACCAAGGAA1X;C^C^ 

Leu Asn Thr Val He His Gin Gly Met Asp Ala lie Ala Ala Cys He 
1425 1430 1435 

GAG UUA CAA GGA CGA U3G AAU GCU UCA CUU AUA CAA CGC GAU CUC CUA 
gSu Leu S£ S Arg Trp Asn Ala Ser Leu lie Gin Arg Asp Leu Leu_ 
1440 1445 1450 

AUU GCA GGU GGA GUU UUU AUC GGA GGC AUU UUG AUG AUG UGG AGC CUA 
S Ala Gly Gly Val Phe He Gly Gly He Leu Met Met Trp Ser Leu 

1460 I 465 

UUU ACU AAA UGG AGJJ AAC ACA AAU GUC UCA CAU CAG GGG AAG AAC AAA 
Phe Thr Lys Trp Ser Asn Thr Asn Val Ser His Gin Gly Lys Asn Lys 

I.475 1480 148b 

CGC AGU AGA CAA AAA CUU OGA UUC AAA GAA GCA AGA GAC AAC AAA UAU 
Sg S Arg Gin Lys Leu Arg Phe Lys Glu Ala Arg Asp Asn Lys Tyr 
1490 1495 1500 

QCA UAU GAU GUC ACA. GGA UCG GAA GAA UGC CUU GGC GAG AAU UUU 
Ala Tyr Asp Val Thr Gly Ser Glu Glu Cys Leu Gly Glu Asn Phe Gly 
1505 1510 l 5i5 

ACA GCC UAU ACA AAG AAA GGU AAA GGA AAA GGA ACU AAA GUU GGA CUC 
?£ Sa Tyr Thr Lys Lys Gly Lys Gly Lys Gly Thr Lys Val Gly Leu 
1520 1525 1530 • 

GGU GUG AAG CAG CAU AAA UUC CAU AUG AUG UAC GGU UUC GAU CCC CAA 
Sy Val Lys Gin His Lys Phe His Met Met Tyr Gly Phe Asp Pro Gin 

1540 1545 

C^UACAACC^AUUa^UUUGTC 

Glu Tyr Asn Leu He Arg Phe Val Asp Pro Leu Thr Gly Ala Tta L^u 
y 1555 1560 1565 

GAU GAA CAA AUC CAU GCC GAU AUA CGC UUA AUU CAA GAG CAC UUC GCU 



4127 



4175 



4223 



4271 



4319 



4367 



4415 



4463 



4511 



4559 



4607 



4655 



4703 
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Asp Glu Gin lie His Ala Asp Ile^Arg Leu lie Gin GluHis 



Phe Ala 



1570 

GAA AUU QGU 
Glu He Arg 
1585 

AUU UAC GGC 
He Tyr Gly 
1600 

GCA AAC GCU 
Ala Asn Ala 



1575 



GAG GAG GCA GUG AUU AAU GAC ACA AUU GAA AGG 
Glu Glu Ala Val He Asn Asp Thr He Glu Arg 

1590 1595 

AAU CCU GGA CUA CAA GCA UUU UUC AUA CAA AAU 
Asn Pro Gly Leu Gin Ala Phe Phe He Gin Asn 
1605 1610 

CUGAGAGUUGAUUUAACACCACAUUC^OTJA^ 
Leu Arg Val Asp Leu Thr Pro His Ser Pro Thr 
1620 1625 



CAG CAG 
Gin Gin 



GGG UCA 
Gly Ser 
1615 

CGA GUU 
Arg Val 
1630 



1635 1640 X 



CGU CAG ACU GGA ACA GCU AUA ACU AUA CCC AUU GGU CAA GUC 
Arg Gin Gly Thr Ala He Thrlle Pro lie Gly Gin Val 

GCA AAU GAA GCA GOG GUU GCA CAC GAG OCA AAA UCC AU3 AB3 
SaSSu Ala Gly Val Ala His Glu Ser Lys Ser Met Met 
1665 1670 "75 

UUG GGU GAU UAC ACA CCA AUA UCG CAA CAA UUG UGU CUA GUA 
Su S 2p Tyr Thr Pro He Ser Gta Gin Leu Cys Leu Val 
1680 1685 1690 

GAC UCG GAU GGG GUA AAG CGG AAU GUA UUU UCU AUU GGA UAU 
Asp Ser Asp Gly Val Lys Arg Asn Val Phe Ser He Gly Tyr 

1700 1705 



CCA AUC 
Pro He 



AAC GGG 
Asn Gly 



CAA AAU 
Gin Asn 
1695 

GGC UCA 
Gly Ser 
1710 



UAU CUU AUU UCA CCA GCG CAC UUA UUC AAA UAC AAC AAU GOT GAA AUA 
Tyr Leu lie Ser Pro Ala His Leu Phe Lys Tyr Asn Asn Gly Glu lie 

1715 1720 



ACA AUU AGA UCA UCA AGA GGA UUG UAC 
Thr He Arg Ser Ser Arg Gly Leu Tyr 
1730 1735 

UUA AAA UUA CAU CCA AUU GCA CAC AGA 
Leu Lys Leu His Pro He Ala His Arg 
1745 "50 

CCA AAG GAU UUC CCA CCG UUC CCA AUG 
Pro Lys Asp Phe Pro Pro Phe Pro Met 
1760 1765 

UCA CGA GAU AUG CGA GUC UGC CUA GUA 

Ser Arg Asp Met Arg Val Cys Leu Val 

1780 



AAA AUU CGU AAU UCU GUG GAU 
Lys He Arg Asn Ser Val Asp 

1740 

GAC AUG GUC AUA AUU CAA CUC 
Asp Met Val He He Gin Leu 
1755 

CGC UUG AAA UUC GAA CAA CCA 
Arg Leu Lys Phe Glu Gin Pro 
1770 1775 

GGA GUC \AC UUC CAA CAG AAU 
Gly Val \sn Phe Gin Gin Asn 
1785 I 790 
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Tyr Ser Thr Cys lie Val Ser Glu Ser Ser Val Thr Ala Pro Lys Gly 
* 1795 1800 1805 

AAU QGA UUU OSG AAA CAU UGG AUA UCA ^ OTC Q3U CAA ^ 

Asn Gly Asp Phe Trp Lys His Trp He Ser Thr Val Asp Gly Gin Cys 
1810 1815 1820 

GGA CUA CCA UUG GUA GAU ACU AAG AGC AAA CAU AUU GUC GGA AUU CAU 
Gly Leu Pro Leu Val Asp Thr Lys Ser Lys His lie Val Gly He Has 
1825 1830 1835 

Ser Leu Ala Ser Thr Ser Gly Asn Thr Asn Phe Phe Val Ala Val Pro 
1840 1845 1850 . 1855 



5423 



GAG AAC UUU AAU GAA UAC AUC AAU GGA CUC GUG CAA GCA AAU AAA UGG 
Su i£ Phe Asn Glu Tyr He Asn Gly Leu Val Gin Ala Asn Lys Trp 

I860 1865 1» 70 

GAA AAA GGA UGG CAC UAU AAU CCG AAU CUC AUA UCC UGG UGU GGA CUA 
Glu Lys Gly Trp His Tyr Asn Pro Asn Leu lie Ser Trp Cys Gly Leu 

1875 1880 188b 

AAUUUAGUUGAUUC^GCCCCAAAAGGUUUGUUUAAAACGUCAAAAUUG 
Asn Leu Val Asp Ser Ala Pro Lys Gly Leu Phe Lys Thr Ser Lys Leu 
1890 1895 1900 

GUA GAA GAC UUG GAC GCG AGC GUU GAA GAG CAA UGC AAG AUC ACC 
Val Glu Asp Leu Asp Ala Ser Val Glu Glu Gin Cys Lys lie Thr Glu 
1905 1910 1915 

ACA UGG CUC ACA GAG CAA UUA CAA GAU AAU UUA CAA GUG GUU GCG AAA 
Thr Trp Leu Thr Glu Gin Leu Gin Asp Asn Leu Gin Val Val Ala Lys 
1920 1925 1930 "35 

Cys So Sy Gin Leu Val Thr Lys His Val Val Lys Gly Gin Cys Pro 

1940 1945 19i0 

CAC UUU CAA UUG UAC UUA UCA ACA CAU GAC GAU GCU AAA GAA UAC UUC 
His Phe Gin Leu Tyr Leu Ser Thr His Asp Asp Ala Lys Glu Tyr Phe 

1955 1960 1965 

GCA CCC AUG CUU GGA AAA UAC GAC AAG AGU AGG CUU A^C AGA GCA 
Ala Pro Met Leu Gly Lys Tyr Asp Lys Ser Arg Leu Asn Arg Ala Ala 
1970 1975 19 80 

UUU AUC AAA GAC AUA UCA AAA UAU GCA AAA CCA AUU UAC AUU GGA GAA 
Phe lie Lys Asp lie Ser Lys Tyr Ala Lys Pro lie Tyr lie Gly Glu 
1985 1990 1995 

AUC GAG UAU GAU AUC UUU GAU AGA GCU GUA CAG CGG GUU GUC AAU AUU 
lie Glu Tyr Asp He Phe Asp Arg Ala Val Gin Arg Val Val Asn lie 
2000 2005 2010 2Ui:> 



5471 



5519 



5567 



5615 



5663 



5711 



5759 



5807 



5855 



5903 



5951 



5999 
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COC AAA AAU GUU AUG CAA « « « S S S S S SI 
Leu Lys Asn Val Gly Met Gin Gin Cys Val Tyr Val Tnr Asp 

2020 2025 

GAA AUU UUC AGA UCA CUU AAC CUG Ala Ala Val Gly Ala Leu Tyr 

Glu lie Phe Arg Ser Leu Asn Leu Asn Ala Ala Val Gly Al^ 

2035 2040 

^^^AAaAAAAA.^^^^^UC.A^G^aACA^ 

Thr Gly Lys Lys Lys Asn Tyr ^Glu Asn Pne ^ 



2050 



Tlrr CGU AUU UAC AAU GGS CAA CUU 

2065 2070 

GGC GUA UGG AAD GGA UCG CUC Ala Glu lie Arg Pro lie Glu Lys 
Gly Val Trp Asn Gly Ser Leu Lys Ala Glu U£hrg Pro ^ 
2080 2085 

— S2S2SSS5ESSS22SS 

Thr Met Leu Asn Lys Thr Arg inr «ws 2110 



» ^ Ti a Tv rrr- TJGC GUG GAU GAU UUU AAU AAU CAA UUC UAU 

S £&£ E S S S «-■-•-»-£,"'• «- 

2115 2120 

222SSS2SSSS2££=S 

2130 2135 

Gly Gly Trp Asn Arg Leu Leu Glu Lys Leu wro wu^ 
2145 2150 

Cys Asp Ala Asp Gly Ser Gin Phe Asp Ser Ser^Leu xnr y ^ 

2160 2165 

He Asn Ala Val Leu Asn lie Arg 2igQ 



rrv nar ACU GAG AUU GUU UAC ACA 
Aim GGA GCG CAA A^ Om AAG AAC UAC ACU GAG 

lie Gly Ala Gin Met Leu Lys Asn Leu Tyr Thr Glu V yr 

2195 2200 

^ ^ f-ir oa UCA AUC GUG AAG AAA UUC AAA GGU AAC 
CCA AUC GCA ACG CCA GAC GGA UCA AUC GLK» ^ ^ 

Pro He Ala Thr Pro Asp Gly Ser lie vao. y ^ 
2210 2215 
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6479 
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6623 
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2225 



2230 



2235 



AUA GCU UUC AAC UAU GOC AUG CUA UCA AGU GGU AUC AAA GAA GAA GAA 
Se" Phe Asn Tyr Ala Met Leu Ser Ser Gly He Lys Glu Glu Glu 
2240 2245 2250 2255 

AUC GAU AAU UGC UGU AGA AUG UUC GCG AAU GGU GALJ GAC UUA CUC CUA 
lie Asp Asn Cys Cys Arg Met Phe Ala AsnGly Asp Asp Leu Leu Leu 

2260 2265 2270 

^GUGCAUCOTGAUUUU^OJCHJUmGAUGWOT^MUCAC 

Ala Val His Pro Asp Phe Glu Phe He Leu Asp Glu Phe Gin Asn His 

2275 2280 2285 

UUUGQGAAUCUUOKJCUGAACUUCGAAUUUACAUCACGAACACGAGAU 
Phe Gly Asn Leu Gly Leu Asn Phe Glu Phe Thr Ser Arg Thr Arg Asp 
2290 2295 2300 

AAA UCC GAA CUG UGG UUC AUG UCC ACA AGA GGC AUC AAG UAU GAA GGA 
££s 2r Glu Leu Trp Phe Met Ser Thr Arg Gly lie Lys Tyr Glu Gly 
2305 2310 2315 

AUU UAC AUA CCA AAG CUU GAG AAA GAA AGA AUA GUC GCC AUA CUU GAA 
He Tyr He Pro Lys Leu Glu Lys Glu Arg He Val Ala He Leu Glu 
232 0 2325 2330 2335 

UGG GAU CGA UCA AAC UUG CCU GAA CAU AGG UUG GAA GCU AUA UGU GCA 
■Etp Asp Arg Ser Asn Leu Pro Glu His Arg Leu Glu Ala He Cys Ala 

2340 2345 -«= u 

GCG AUG GUU GAG GCC UGG GGA UAU UCC GAU CUC GUU CAU GAA AUA CGA 
Ala Met Val Glu Ala Trp Gly Tyr Ser Asp Leu Val His Glu lie Arg 

2355 2360 2365 

AAG UUC UAU GCG UGG CUU UUG GAA AUG CAA CCU UUU GCA AAU CUC GCA 
Lys Phe Tyr Ala Trp Leu Leu Glu Met Gin Pro Phe Ala Asn Leu Ala 
2370 2375 2380 

AAA NAA GOG UUG GCC CCA UAC AUU GCC GAG ACA GCA CUC CGC AAU CUC 
Lys Xaa Gly Leu Ala Pro Tyr He Ala Glu Thr Ala Leu Arg Asn Leu 
2385 2390 2395 

UAU CUU GGA ACG GGU AUC AAA GAG GAA GAA AUU GAA AAA UAU CUU AAA 
Tyr Leu Gly Thr Gly He Lys Glu Glu Glu He Glu Lys Tyr Leu Lys 
2^00 2405 2410 2415 

CAA UUC AUU AAG GAU CUU CCC GGA UAC AUA GAA GAU UAC AAU GAA GAU 
Gin Phe He Lys Asp Leu Pro Gly Tyr He Glu Asp Tyr Asn Glu Asp 

2420 2425 2430 

C^UIX:CAU<GAGUCGC^AOTGUUC^UGCGGCTG^ 
Val Phe His Gin Ser Gly Thr Val Asp Ala Gly Ala Gin Gly Gly Ser 

2435 2440 2445 

GGA AGC CAA GGG ACA ACA CCA CCA GCA ACA GGU AGU GGA GCA AAA CCA 
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Lys Pro 



ACU GGU 
Thr Gly 



GGA UCU 
Gly Ser 
2495 

CAG GUU 
Gin Val 
2510 



Gly ser Gin Gly Thr <J*r Pro Pro Ala <Thr Gly Ser GjyAla 

2450 24bb 
/^^o 7\rv^ Tira nnr GCA GGA UCU GGU AGU GAC ACA GGA GCU GGA 

JSSSrSSaS Ser Gly Ser Asp T*r GlyAla Gly 
2465 2470 

GUA ACU GGA AGU CAA GCA AGG ACU QGC AGU GGC ACT GGG ACG 
Val Thr Gly Ser Gin Ala Arg Thr Gly Ser Giy^nr y 
2480 2485 

GGA GCA ACC GGA GGC CAA ser Gly Ster G^V 

Gly Ala Thr Gly Gly Gin Ser Gly Ser Gly Ser Ciy 

2500 2505 

ssss=s»==s=ss=£== 

2515 2520 

_ __ _p t]C n QGA AAA AUU UCU GUA CCA AAG CUC AAG 

GAU GUG GAU GCA GGC UCA ACA GGA Val Lys L*U Lys 

Asp Val Asp Ala Gly Ser Tnr ^xyijy ^ Q 
2530 2bJb 

Mir OTP T7UA CCU AAA GCA AAA GGA AAA GAU GUG 
GCC AUG UCA AAG AAA AUG CGC UUA Uvju ™ Val 
Ala Met Ser Lys Lys Met Arg to Pro Lys Ala 
2545 2550 

Leu His Leu Asp Phe Leu Leu Thr W uys n 2 57B 
2560 2565 

S2SSSSS = = = S = SSS,S 

2580 2bH!3 

2595 2600 

T ™ TV2n Mir gaa AAU GGU UGC UCA CCA AAC AUA 

?F SL 52 S S S SS £ Gly Cys Ser Pro Asn I le 
Ser Gly Leu Met Val Trp uys lie 

2610 2&lb 

nrr- mtt r*MT AAA GAU GAA CAA AGG GUC UUC CCA 

2625 2630 

^>rT rrii rim rrA ACU UUC CGA CAA AUU AUG 

2640 2645 

His His Phe Ser Asp Ala Ala Glu Ala Tyr lie y 

2660 26bb 



7439 



7487 



7535 



7583 



7631 



7679 



7727 



7775 



7823 



7871 



7919 



7967 



8015 
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S Su Arg OVr Met Pro Arg Tyr GlyLeu Gin Arg Asn Ljuttr Asp 

2675 2680 

2690 2695 

2710 2715 



2705 



__. fY-tj ri3T UCA AAC ACA CGA CUG UUC QGU UUG GAU GGA AAU GUC GGC 
Val Arg Gly Ser Asn Thr Arg Leu Phe Gly Leu Asp Gly Asn Val Gly 
2720 2725 



GAG ACU CAG GAG AAU ACA GAG AGA GAC 99^ elv Asp Vtel Ser Arg 

Glu Tte Gin Glu Asn Ita Glu Arg Hxa Thr Ala Gly Asp Val Ser Arg 

2740 2745 
AAC AUG CAC UCU CUG UUG GGA GUG CAG CAA CAC CAC UAGUCUCCUG 
Asn Met His Ser Leu Leu Gly Val Gin Gin His His 

2755 2760 

GAAACCCUGU UUGCAGUACC AAUAAUAUSU ACUAAUAUAU AGUAUUUUAG UGAGGUUUUA 
CCUCGUCUUU ACUGUUUUAU UACGUAUGUA UUUAAAGCGU GAACCAGUCU GCAACAUACA 
GOGUUQGACC CAGUGUGUUC UGGUGUAGCG UGUACUAGCG UCGAGCCMJG AGAUGGACUG 
CACUGGGUGU GGUUUUGCCA CUUGUGUUGC GAGUCUCCUG GUAAGAGACA AAAAAAAAAA 



AA 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2763 amino acids 

(B) TYPE: amino acid 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Glu Glu Lys Gin Arg Glu tyr Imx Ala Lys Asp Gin Lys L*u Ser Arg 
! 5 10 

Met lie Gin Phe lie Lys Glu Arg Cys Asn Pro Lys Phe Ser His L*u 

2 5 



20 



Pro Thr Leu Trp Gin Val Ala Glu Thr lie Gly is iyr Ihr Asp Asn 
35 40 45 



8063 



8111 



8159 



8207 



8255 



8301 



8361 
8421 
8481 
8541 
8543 
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Gin Ser Lys Gin lie Met Asp Val Ser Glu Ala Lea He Lys Val Asn 
50 55 60 

Thr Leu Thr Pro Asp Asp Ala Met Lys Ala Ser Ala Ala Leu Leu Glu 
65 70 ">5 80 

Val Ser Arg Trp Tyr Lys Asn Arg Lys Glu Ser Leu Lys Thr Asp Ser 

85 90 9 

Leu Glu Ser Phe Arg Asn Lys He Ser Pro Lys Ser Thr lie Asn Ala 

100 105 110 

Ala Leu Met Cys Asp Asn Gin Leu Asp Lys Asn Ala Asn Phe Val Trp 
U5 120 125 

Gly Asn Arg Glu Tyr His Ala Lys Arg Phe Phe Ala Asn Tyr Phe Xaa 
130 135 I 40 

Ala Val Asp Pro Thr Asp Ala Tyr Glu Lys His Val Thr Arg Phe Asn 
145 150 155 

Pro Asn Gly Gin Arg Lys Leu Ser He Gly Lys Leu Val He Pro Leu 

165 I 70 175 

Asp Phe Gin Lys He Arg Glu Ser Phe Val Gly Leu Ser lie Asn Arg 

180 185 I 90 

Gin Pro Leu Asp Lys Cys Cys Val Ser Lys He Glu Gly Gly Tyr He 
195 200 205 

Tyr Pro Cys Cys Cys Val Thr Thr Glu Phe Gly Lys Pro Ala Tyr Ser 
210 215 220 

Glu He He Pro Pro Thr Lys Gly His He Thr He Gly Asn Ser lie 
225 230 235 240 

Asp Ser Lys He Val Asp Leu Pro Asn Thr Thr Pro Pro Ser Met Tyr 

245 250 z » 

He Ala Lys Asp Gly Tyr Cys Tyr He Asn He Phe Leu Ala Ala Met 

260 265 270 

He Asn Val Asn Glu Glu Ser Ala Lys Asp Tyr Thr Lys Phe Leu Arg 
275 280 285 

Asp Glu Leu Val Glu Arg Leu Gly Lys Trp Pro Lys Leu Lys Asp Val 
290 295 300 

Ala Thr Ala Cys Tyr Ala Leu Ser Val Met Phe Pro Glu He Lys Asn 
305 310 315 

Ala Glu Leu Pro Pro He Leu Val Asp His Glu Asn Lys Ser Met His 

325 330 ■>■" 

Val He Asp Ser Tyr Gly Ser Leu Ser Val Gly Phe His He Leu Lys 
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340 



345 



350 



Ala Ser 



to lie Gly Gin Leu lie Lys Phe Gin <iyr Glu Ser Met Asp 

36b 



355 



360 



Ser Glu Met Arg Glu Tyr He Val Gly Gly to Leu ^ Gin Gin to 

375 jou 



370 



Phe Asn to Leu Leu Lys Met Leu to Lys Asn Met Phe Lys Pro Glu 



385 



390 



395 



Arg lie Lys Gin lie lie Glu Glu Glu Pro Phe Leu L^u Met Met Ala 



405 



410 



lie Ala Ser Pro to Val Leu lie Ala Leu Tyr Asn Asn Cys Tyr lie 



420 



425 



Glu Gin Ala Met to Tyr Trp lie Val Lys Asn Gin Gly Val Ala Ala 



435 



440 



lie Phe Ala Gin Leu Glu Ala I^u Ala Lys Lys to Ser Gin Ala Glu 

455 460 



450 



Leu Leu Val Leu Gin Met Gin lie Leu Glu Lys Ala Ser Asn Gin Leu 



465 



470 



475 



Arg Leu Ala Val Ser Gly Leu Ser His lie Asp Pro Ala Lys Arg I«u 



485 



490 



Leu Trp Ser His Leu Glu Ala Met Ser to Arg Ser Glu Met Asn Lys 



500 



505 



Glu Leu He Ala Glu Gly Tyr Ala L*u Tyr Asp Glu Arg Leu Tyr to 

520 525 



515 



Leu Met Glu Lys Ser Tyr Val Asp Gin Leu Asn Gin Ser Trp Ala Glu 
530 535 540 

Leu Ser Tyr Cys Gly Lys Phe Ser Ala lie Trp Arg Val Phe Arg Val 



545 



550 



555 



Arg Lys Tyr Tyr Lys Pro Ser Leu to Val Arg Lys Ser Val Asp Leu 



565 



570 



Gly Ala Val Tyr Asn He Ser Ala Tta His Leu He Ser Asp Leu Ala 



580 



585 



Arg Lys Ser Gin Asp Gin Val Ser Ser lie Leu to Lys L*u Arg Asn 



595 



600 



Gly Phe Tyr Asp Lys Leu Glu Lys Val Arg lie Arg Thr He Lys to 



610 



615 



Val Tyr Trp Phe lie Pro Asp He Phe Arg Leu Val His lie Phe lie 



625 



630 
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Val Leu Ser Leu Leu Thr Thr He Ala Asn Thr He lie Val Thr Met 

645 650 655 

Asn Asp Tyr Lys Lys Leu Lys Lys Gin Gin Arg Glu Asp Glu Tyr Glu 

660 665 670 

Ala Glu He Ser Glu Val Arg Arg He His Ser Thr Leu Met Glu Glu 
675 680 685 

Arg Lys Asp Asn Leu Thr Cys Glu Gin Phe He Glu Tyr Met Arg Xaa 
690 695 700 

Asn His Pro Arg Leu Val Gly Xaa Thr Leu Asp Leu Thr His Thr Gly 
705 710 715 

Val He His Glu Gly Lys Ser Asn Leu Glu Thr Asn Leu Glu Gin Ser 

725 730 735 

Met Ala Val Gly Thr Leu He Thr Met He Leu Asp Pro Gin Lys Ser 

740 745 750 

Asp Ala Val Tyr Lys Val Leu Asn Lys Met Arg Thr Val He Ser Thr 
755 760 765 

He Glu Gin Asn Val Pro Phe Pro Ser Val Asn Phe Ser Asn He Leu 
770 775 780 

Thr Pro Pro Val Ala Gin Gin Ser Val Asp Val Asp Glu Pro Leu Thr 
785 790 795 800 

Leu Ser Thr Asp Lys Asn Leu Thr He Asp Phe Asp Thr Asn Gin Asp 

805 810 81b 

Leu Pro Ala Asp Thr Phe Ser Asn Asp Val Thr Phe Xaa Asp Trp Trp 

820 825 830 

Ser Xaa Gin Leu Ser Asn Asn Arg Thr Val Xaa His Tyr Arg Xaa Trp 
83 5 840 845 

Glv Glu Ser Xaa Leu Glu Phe Thr Arg Glu Asn Ala Ala His Thr Ser 
850 855 860 

He Glu Leu Ala His Ser Asn He Glu Arg Glu Phe Leu Leu Arg Gly 
865 870 875 880 

Ala Val Gly Ser Gly Lys Ser Thr Gly Leu Pro Tyr His Leu Ser Met 

885 890 895 

Arg Gly Lys Val Leu Leu Leu Glu Pro Thr Arg Pro Leu Ala Glu Asn 

900 905 910 

Val Cys Arg Gin Leu Gin Gly Pro Pro Phe Asn Val Ser Pro Thr Leu 
915 920 925 
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Gin Met Arg Gly Leu Ser Ser Phe Gly Cys Pro lie Thr He Met 
930 935 y4U 

Thr Ser Gly Phe Ala Leu His Met Tyr Ala Asn Asn Pro Asp Lys lie 
945 950 955 

Ser Glu Tyr Asp Phe lie lie Phe Asp Glu Cys His lie Met Glu Ala 

965 970 9/5 

Pro Ala Met Ala Phe Tyr Cys Leu Leu Lys Glu Tyr Glu Tyr Arg Gly 

980 9S5 " u 

Lys lie lie Lys Val Ser Ala Thr Pro Pro Gly Arg Glu Cys Glu Phe 
* 995 1000 I 005 

Thr Thr Gin His Pro Val Asp He His Val Cys Glu Asn Leu Thr Gin 
1010 1015 1020 

Gin Gin Phe Val Met Glu Leu Gly Thr Gly Ser Thr Ala Asp Ala Thr 
1025 1030 1035 

Lys Tyr Gly Asn Asn He Leu Val Tyr Val ^ Ala Ser Tyr Asn Asp^Val 
Asp Ser Leu Ser Gin Ala Leu Val Glu^Leu Lys Phe Ser Val^Ile Lys 



1060 1065 

Thr Thr Gly 

1085 



Val Asp Gly Arg Thr Met Lys Gin Asn Thr Thr Gly lie lie Thr Asn 
1075 1080 1° 85 

Gly Thr Ala Gin Lys Lys Cys Phe Val Val Ala Thr Asn lie He Glu 
1090 1095 HOG 

Asn Gly Val Thr Leu Asp He Asp Val Gly Val Asp Phe Gly Leu Lys^ 
1105 111° 11 

Val Ser Ala Asp Leu Asp Val Asp Asn Arg Ala Val Leu Tyr Lys Arg 

1125 H30 1J - J3 

Val Ser He Ser Tyr Gly Glu Leu He Gin Arg Leu Gly Arg Val Gly 

1140' H45 

Arg Asn Lys Pro Gly Thr Val He Arg He Gly Lys Thr Met Lys Gly 
U55 1160 H65 

Leu Gin Glu He Pro Ala Met He Ala Thr Glu Ala Ala Phe Met Cys 
1170 H* 7 * 1180 

Phe Ala Tyr Gly Leu Lys Val He Thr His Asn Val Ser Thr Thr His 
1185 H90 1195 

Leu Ala Lys Cys Thr Val Lys Gin Ala Arg Thr Met Met Gin Phe Glu 

1205 121° 

Leu Ser Pro Phe Val Met Ala Glu Leu Val Lys Phe Asp Gly Ser Met 
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1220 1225 1230 

His Pro Gin He His Glu Ala Leu Val Lys Tyr Lys Leu Arg Asp 
1235 1240 1^ 

Val lie Met Leu Arg Pro Asn Ala Leu Pro Arg Val Asn Leu His Asn 
1250 1255 I 2 " 

Trp Leu Thr Ala Arg Asp Tyr Asn Arg He Gly Cys Ser Leu Glu Leu 
1265 1270 1275 

Glu Asp His Val Lys lie Pro Tyr Tyr IleArg Gly Val Pro AspLys 

128 5 

Leu Tyr Gly Lys Leu Tyr Asp lie lie Leu Gin Asp Ser Pro Thr Ser 

1300 I 305 

Cys Tyr Ser Arg Leu Ser Ser Ala Cys Ala Gly Lys Val Ala Tyr Thr 
^ 1315 1320 "25 

Lieu Arg Thr Asp Pro Phe Ser Leu Pro Arg Thr lie Ala He He Asn 
1330 1335 1340 

Ala Xaa He Thr Glu Glu Tyr Ala Lys Arg Asp His Tyr Arg Asn Met ^ 
1345 1350 1353 

lie Xaa Asn Pro Ser Ser Ser His Ala Phe Ser Leu Asn Gly Leu Val 

1365 I 370 

Ser Met He Ala Thr Arg Tyr Met Lys Asp His Thr Lys Glu Asn He 

1380 1385 1390 

Asp Lys Leu He Arg Val Arg AspGln Leu Leu Glu Ph^Gl* Gly Thr 

Ser Glu Leu Met Glu 

1410 "IS 



1395 1400 
Gly Met Gin Phe Gin Asp Pro Ser Glu Leu Met Glu lie Gly Ala Leu 



Asn Thr Val He His Gin Gly Met Asp Ala lie Ala Ala Cys He Glu^ 
1425 1430 14 -> 5 

Leu Gin Gly Arg Txp Asn Ala Ser Leu He Gin Arg Asp Leu Leu He 

14 45 1450 

Ala Gly Gly Val Phe He Gly Gly He Leu Met Met Trp Ser Leu Phe 

1460 14 65 if4,u 

Thr Lys Trp Ser Asn Thr Asn Val Ser His Gin Gly Lys^Asn Lys Arg 

Ser Arg Gin Lys Leu Arg Phe Lys Glu Ala Arg Asp Asn Lys Tyr Ala 

1490 1495 
Tyr Asp Val Thr Gly Ser Glu Glu Cys Leu Gly Glu Asn Phe Gly Thr 
1505 1510 1515 
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Ala Tyr Thr Lys Ly S Gly Lys Gly Lys Gly Thr Lys Val Gly Leu Gly 

1525 1530 

Val Lys Gin His Lys Phe His Met Met Tyr Gly Phe Asp Pro Gin Glu 



1540 



1545 



Tyr Asn Leslie Arg Phe Val Ajp^Pro Leu Thr Gly Ala^Thr Leu Asp 

Glu Gin He His Ala Asp lie Arg Leu lie Gin Glu His Phe Ala Glu 
1570 1575 1580 

He Arg Glu Glu Ala Val lie Asn Asp Thr lie Glu Arg Gin Gin Ita 
1585 1590 1595 

Tyr Gly Asn Pro Gly Leu Gin Ala Phe Phe lie Gin Asn Gly Ser Ala 

1605 1610 

Asn Ala Leu Arg Val Asp Leu Thr Pro His Ser Pro Thr Arg Val Val 

1620 I 625 

Thr Gly Asn^Asn lie Ala Gly Ph^Pro Glu Tyr Glu Gly^Thr Leu Arg 

lie Gly Gin Val Pro lie Ala 



Gin Thr Gly Thr Ala lie Thrlle Pro 
1650 



1655 1660 



Asn Glu Ala Gly Val Ala His Glu Ser Lys Ser Met Met Asn Gly Leu 
1665 1670 16^5 

Gly Asp Tyr Thr Pro He Ser Gto Gl* Leu Cys Leu Val Gin Asn Asp 

1685 169° 

Ser Asp Gly Val Lys Arg Asn Val Phe Ser lie Gly Tyr Gly Ser Tyr 

1700 1705 

Leu lie Ser Pro Ala His Leu Phe Lys Tyr Asn Asn Gly Glu lie Thr 
1715 1720 1725 

lie Arg Ser Ser Arg Gly Leu Tyr Lys He Arg Asn Ser Val Asp Leu 
1730 1735 

Lys Leu His Pro lie Ala His Arg Asp Met Val lie lie Gin Leu Pro 
^45 1750 1755 

Lys Asp Phe Pro Pro Phe Pro Met Arg Leu Lys Phe Glu Gin Pro Ser 

1765 . 1770 

Arg Asp Met Arg Val Cys Leu Val Gly Val Asn Phe Gin Gin Asn Tyr 

1780 I 785 

Ser Thr Cys He Val Ser Glu Server Val Thr Ala ProLys Gly Asn 



1795 



1800 
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Gly Asp Phe Trp Lys His Trp He Ser Thr Val Asp Gly Gin Cys Gly 
1810 1815 

Leu Pro Leu Val Asp Thr Lys Ser Lys His lie Val Gly He His Ser 
1825 1830 1835 

Leu Ala Ser Thr Ser Gly Asn Thr Asn She*. Val Ala Val Pr^Glu 

1845 

Asn Phe Asn Glu Tyr lie Asn Gly Leu Val Gin Ala Asn Lys Trp Glu 

1860 1865 18 ' u 

Lys Gly Trp His Tyr Asn Pro Asn Leu lie Ser Trp Cys Gly Leu Asn 
1875 1880 1885 

Leu Val Asp Ser Ala Pro Lys Gly Leu Phe Lys Thr Ser Lys Leu Val 
1890 1895 19°° 

Glu Asp Leu Asp Ala Ser Val Glu Glu Gin Cys Lys lie Thr Glu Thr 
1905 1910 I 915 

Trp Leu Thr Glu Gin Leu Gin Asp Asn LeuGln Val Val Ala Lys^Cys 

1925 

Pro Gly Gin Leu Val Thr Lys His Val Val Lys Gly Gin Cy^Pro His 

1940 1945 

Phe Gin Leu Tyr Leu Ser Thr His Asp Asp Ala Lys Glu Tyr Phe Ala 
1955 1960 1965 

Pro Met Leu Gly Lys Tyr Asp Lys Ser Arg Leu Asn Arg Ala Ala Phe 
1970 1975 1980 

lie Lys Asp lie Ser Lys Tyr Ala Lys Pro lie Tyr lie Gly Glu 11^ 
1985 1990 1995 

Glu Tyr Asp He Phe^Asp Arg Ala Val GlnArg Val Val Asn He_Leu 

Lys Asn Val Gly Met Gin Gta Cys Val Tyr Val Thr Asp Glu Glu Glu 

2020 2025 

lie Phe Arg Ser Leu Asn Leu Asn Ala Ala Val Gly Ala leu Tyr Thr 
2035 2040 2045 

Gly Lys Lys Lys Asn Tyr Phe Glu Asn Phe Ser Ser Glu Asp Lys Glu 
2050 2055 2060 

Glu He Val Met Arg Ser Cys Glu Arg He Tyr Asn Xaa Gin Leu Gly 
2065 2070 2075 

Val Trp Asn Gly Ser Leu Lys Ala Glu He Arg Pro lie Glu Lys Thr 

2085 2090 

Met Leu Asn Lys Thr Arg Thr Phe Thr Ala Ala Pro Leu Glu Thr Leu 
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2100 



2105 2110 



Leu Gly GlyLys Val Cys Val As^Asp Phe Asn Asn Gln_Phe Tyr Ser 



His His Leu 
2130 



Glu Gly Pro Trp Thr Val Gly He Thr Lys Phe Tyr Gly 



2135 2140 



Gly Trp Asn Arg Leu LsuGlu Lys Leu Pro GluGly Trp Val Tyr Cys 



2145 



2150 



2160 



Asp Ala Asp Gly Ser_Gln Phe Asp Ser SerLeu Thr Pro Tyr Leu_Ile 



2165 



Asn Ala Val Leu Asn lie Arg Lou Gin She Met Glu Asp Trp^Asp lie 



2180 



2185 



Gly Ala Gin Met Leu Lys Asn L*u Tyr Tnr Glu lie Val Tyr T*r Pro 
2195 2200 2205 

Ala Thr Pro Asp Gly Ser lie Val Lys Lys Phe Lys Gly Asn Asn 



lie 



2210 



2215 2220 



Ser Gly Gin Pro Ser Thr Val Val Asp Asn Thr Leu Met Val He lie 
2225 2230 2235 

Ala Phe Asn Tyr Ala Met Leu Ser Ser Gly lie Lys Glu Glu Glu lie 

2245 2250 

Asp Asn Cys Cys Arg Met Phe Ala Asn Gly Asp Asp Leu Leu Leu Ala 

2260 2265 

Val His Pro Asp Phe Glu Phe lie Leu Asp Glu Phe Gin Asn His Phe 
2275 2280 2285 

Gly Asn Leu Gly Leu Asn Phe Glu Phe Thr Ser Arg Thr Arg Asp Lys 

23UU 

2290 ZA ** 

Ser Glu Leu Trp Phe Met Ser Thr Arg Gly lie Lys Tyr Glu Gly lie 
2305 2310 2315 

■ 

Tyr lie Pro Lys Leu Glu Lys Glu Arg lie Val Ala lie Leu GluTrp 



2325 



2330 



Asp Arg Ser Asn Leu Pro Glu His Ar^Leu Glu Ala lie CysMa Ala 

Asp Leu Val His Glu 

2355 2360 



2340 2345 
Met Val Glu Ala Trp Gly Tyr Ser Asp Leu Val His GluUe Arg Lys 



Phe Tyr Ala Trp Leu Leu Glu Met Gin Pro Phe Ala Asn Leu Ala Lys 

2370 2375 
Xaa Gly Leu Ala Pro Tyr He Ala Glu Thr Ala Leu Arg Asn Leu Tyr 
2385 2390 
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!« Gly Tnr Gly lie Lys Glu Glu Glu IleGlu Lys Tyr Leu Ly^Gln 



2405 2410 



Phe lie Lys Asp Leu Pro Gly Tyr Ile^Glu Asp Tyr Asn GluAsp Val 



2420 



2425 



Phe His Gin ser Gly Thr Val Asp Ala Gly Ala Gin GlyGly Sex Gly 



2435 



2440 



Ser Gin Gly Thr Thr Pro ProAla Thr Gly Ser GlyAla Lys Pro Ala 



2450 2455 



Thr ser Gly Ala Gly Ser Gly Ser Asp Thr Gly Ala Gly Thr Gly Val 
2465 2470 2475 

Thr Gly Ser Gin Ala Arg Thr Gly Ser Gly Thr Gly Tnr Gly SerGly 

2485 24yu 

Ala Tnr Gly Gly Gin Ser Gly Ser Gly Ser Gly Tnr Glu GlnVal Asn 

2500 250b 
Ito Gly ser Ala Gly Tnr Asn Ala Tnr Gly Gly Gin Arg Asp Arg Asp 



2515 



2520 



Val Asp Ala Gly Ser Tnr GlyLys He Ser Val ProLys Leu Lys Ala 



2530 



2535 



Met Ser Lys Lys Met Arg Lmx Pro Lys Ala Lys Gly Lys Asp Val £u 
2545 2550 2555 

■ t t^ii T^vi Thr Tvr Lys Pro Gin Gin Gin Asp lie Ser 

His Leu Asp Phe Leu Leu inr iyr uy* 

2565 2570 

Asn Tnr Arg Ala Tnr Lys Glu Glu Phe Asp Arg T*p TVr Asp^a lie 

2580 258b 
Lys Lys Glu Tyr Glu He Asp Asp Tnr Gin Met Tnr Val Val Met Ser 



2595 



2600 



Gly Leu Met Val Trp Cys He Glu Asn Gly Cys Ser Pro Asn He Asn 



ofi1 c 2620 
2610 2615 

Gly Asn Trp Tnr Met Met Asp Lys Asp Glu Gin teg Val Phe Pro 
2625 2630 

Lys Pro Val lie Glu Asn Ala Ser Pro Thr Phe Arg Gin lie Met His 

2645 2650 

_ * j, ~\ _ f^-i . , at A Tvr lie Glu Tyr Arg Asn Ser Thr 
His Phe Ser Asp Ala Ala Glu Ala Tyr x±e y 

2660 266b 
Glu Arg Tyr Met Pro Arg Tyr Gly Leu Gin Arg Asn Leu Tnr Asp Tyr 



2675 



2680 



WO 97/02352 



- 54 - 



PCT/EP96/02673 



Ser Leu Ala Arg Tyr Ala Phe Asp Phe Tyr Glu Met Thr Ser Arg Thr 
2690 2695 2700 

Pro Ala Arg Ala Lys Glu Ala His Met Gin Met Lys Ala Ala Ala Val 
2705 2710 2715 2720 

Arg Gly Ser Asn Thr Arg Leu Phe Gly L*u Asp Gly Asn Val Gly Glu 

2725 2730 ^ /JJD 

Thr Gin Glu Asn Thr Glu Arg His Thr Ala Gly Asp Val Ser Arg Asn 

2740 2745 2750 

Met His Ser Leu Leu Gly Val Gin Gin His His 
2755 2760 

(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "first Adh internal control 

primer" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
TGCATGTCGG TTCTOTTGCA 
(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGOH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "second Adh internal control 

primer" 

(iii) HYPOTHETICAL: NO 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
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20 

CTCAGCAAGT ACCTAGACCA 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "first synthetic PAT gene 

primer n 

(iii) HYPOTHETICAL: NO 



(x i) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

_ 19 
TGTCTCCGGA GAGGAGACC 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEENESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "second synthetic PAT gene 

primer" 

(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

_ 20 
CCAACATCAT GCCATCCACC 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 33 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDECNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "first NIa proteinase gene 

primer" 
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(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCGGGATCCA TGGGGAAGAA CAAACGCAGT TGA 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LHSGfTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDECKESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "second NIa proteinase 

primer" 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
GCGGAGCTCT TACTCTTCAA CGCTCGCGTC 



30 
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INDICATIONS RELATING TO A DEPOSITED MICROORGANISM 

(PCT Rule 13&is) 



A. The indications 



made below relate to the microorganism referred to in the description 



10 



on page 

^IDENTIFICATION OF DEPOSIT 



, line 



34 



Name of depositary institution Agricultural Research Service 

(HRRIt) 

Address of depositary institution (including postal code and country) 

1815 North University Street 
Peoria, H* 61604 
USA 



Further deposits are identified on an additional sheet Q 
Culture Collection 



Date of deposit 

29 June 1995 (29.06.95) 



Accession Number 
NRRL B-21479 



June x=» \ — ^ ■ 

C. ADP rnONAL INDICATIONS (leave blank if not applicable) T^is infonn^ n is conUoocd on an .ddiUona, sbec, j_J 



We 



request the Kxpert Solution where available 



D. DESIGNATED STATES FOR WHICH 



INDICATIONS ARE MADE (if the indications arc not for all designated Stat*) 



L CPPAPATF FURNISHING OF INDICATIONS (leave blank ifnoi applicable) 



The indications listed below will be submitted to tne 
Number of Deposit') 



international Bureau later (specify the general nature ofthe.n*caiions e.g. t 'Accession 




For receiving Office use only 



This sheet was received with the international application 




Authorized officer 



R.LR. Pather 



Form PCT/RO/134 (July 1992) 



For International Bureau use only 



Q bis sheet was received by the Internationa! Bureau on: 
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What is claimed is: 

1 A chimeric gene comprising a monocotyledonous plant promoter operably linked to a 
nucleotide sequence derived from the genomic sequence of a virus infecfng 
monocotyledoneous p.ants. wherein said nucleotide sequence contains a modif.cat.on 
rendering a messenger RNA transcribed from said nucleotide sequence incapable of 
complete translation. 

2 The chimeric gene of claim 1 wherein said virus is selected from the group consisting of 
a potyvirus, a luteovirus. a tenuiivirus, a carmovirus. a machlovirus. a gemin.vrus and a 
reovirus. 

3. The chimeric gene of claim 2 wherein said virus is a potyvirus. 

4 A chimeric gene comprising a monocotyledonous plant promoter operably linked to a 
nucleotide sequence derived from the genomic sequence of a maize dwarf mosa.c 
virus wherein said nucleotide sequence contains a modification rendering a messanger 
RNA transcribed from said nucleotide sequence incapable of complete translation. 

5. The chimeric gene of claim 4 wherein said virus is maize dwarf mosaic virus strain B. 

6. The chimeric gene of claim 4 wherein said transcribed RNA is capable of translating an 
attenuated peptide of a maize dwarf mosaic virus protein. 

7. The chimeric gene of claim 6 wherein said attenuated peptide is less than 20 amino 
acids in length. 

8. The chimeric gene of claim 4 wherein said transcribed RNA cannot be translated. 

9 The chimeric gene of claim 4 wherein said transcribed RNA sequence does not include 
the translation initiation codon of said maize dwarf mosaic virus, strain B. 

10. The chimeric gene of claim 4 wherein said transcribed RNA sequence encodes a 
portion of a viral protein selected from the group consisting ol a coat protein, a 
proteinase, a replicase. a helicase. a Vpg protein, a 6K protein and a helper 
component. 
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1 1 . The chimeric gene of claim 4 wherein said modification comprises addition of a 
premature stop codon into said transcribed RNA. 

12. The chimeric gene of claim 4 wherein expression of said gene in transgenic maize, 
sorghum or sugarcane inhibits infection of said transgenic plants by maize dwarl 
mosaic virus. 

13. The chimeric gene of claim 12 wherein expression of said gene in transgenic maize 
inhibits infection of the transgenic plants by maize dwarf mosaic virus. 

14. The chimeric gene of claim 5 wherein said transcribed RNA comprises nucleotides 
4452 to 5744 of SEQ ID No. 1 and said modification comprises the substitution of a T 
for the A at position 4470 of SEQ ID No. 1 . 

15. The chimeric gene of claim 14 wherein said modification further comprises the insertion 
of an ATG codon immediately before the G at position 4452 of SEQ ID No. 1 . 

16. The chimeric gene of claim 4 wherein said monocotyledonous plant promoter is 
selected from the group consisting of a maize ubiquitin promoter, a maize actin 
promoter and a maize phosphoenolpyruvate carboxylase promoter. 

17. A method for producing a monocotyledonous plant with an inheritable trait of resistance 
to infection by a maize dwarf mosaic virus comprising transforming said plant with a 
chimeric gene according to claim 4. 

18. A monocotyledonous plant having an inheritable trait of resistance to infection by a 
maize dwarf mosaic virus, wherein said plant comprises a chimeric gene according to 
claim 4. 

19. A chimeric gene comprising a plant promoter operably linked to a nucleotide sequence 
derived from the genomic sequence of maize dwarf mosaic virus strain B encoding a 
viral protein other than a coat protein, wherein transgenic expression of said chimeric 
gene in a plant inhibits infection of said plant with said virus. 

20 The chimeric gene according to claim 19 wherein said viral protein is selected from the 
group consisting of RNA dependent RNA polymerase (RdRp) having the amino acid 
sequence from position 1915 to 2435 of SEQ ID No. 2. Nla proteinase having the 
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amino acid sequence from position 1484 to 1914 of SEQ ID No. 2, heiicase having the 
amino acid sequence from position 792 to 1430 of SEQ ID No. 2. and P3 proteinase 
having the amino acid sequence from position 378 to 791 of SEQ ID No. 2. 

21 . The chimeric gene of claim 20 wherein said viral protein is a replicase. 

22 The chimeric gene of claim 20 wherein said plant promoter is selected from the group 
' consisting of a plant ubiquitin gene promoter, a plant actin gene promoter, and a plant 
pith-preferred promoter. 

23. A method for producing a plant with an inheritable trait of resistance to infection by 
maize dwarf mosaic virus strain B comprising transforming said plant with the chm.no 
gene of claim 19. 

24. A plant comprising the chimeric gene of claim 22. 

25 A method for protecting progeny of a monocotyledoneous parent plant from viral 
' infection comprising transforming said parent plant with a chimeric gene accordmg to 
claim 1 and obtaining progeny plants or breeding said parent plant with a plant 
according to claim 18. 

26. A method according to claim 25. wherein said progeny are protected from infection with 
maize dwarf mosaic virus. 

27. A method according to claim 25. wherein the progeny of maize, sorghum or sugarcane 
plants are protected from viral infection. 
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